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I.  INTRODUCTION 


During  the  period  June  1,  1978  to  May  13,  1982,  the  research  "Fault 
Diagnosis  of  Large  Scale  Analog  Systems"  is  sponsored  by  the  Office  of  Naval 
Research  under  Contract  Number  N00014-78-C-04444.  This  research  is  undertaken 
at  the  University  of  Notre  Dame,  Notre  Dame,  Indiana  under  the  direction  of 
the  Principal  Investigator,  Dr.  Ruey-wen  Mu.  The  long-term  objective  of  this 
research  is  to  develop  a  practical  and  reliable  Automatic  Test  Program  Gener¬ 
ator  (ATPG)  which  will  allow  us  to  locate  the  faulty  component(s)  of  a  large 
analog  circuit  when  it  is  faulty.  The  short-term  objective  is  to  search  for 
viable  and  amenable  concepts  under  which  long-term  objectives  can  be  achieved. 
During  this  short  period,  some  significant  progress  has  been  made.  -4 

To  be  sure,  the  fault  analysis  of  analog  circuit  is  an  uncharted  area  of 
research.  There  is  no  precedence  with  which  we  can  follow.  Indeed,  this  has 
been  an  exciting,  enjoyable  and  satisfying  research.  We  continuously  discover 
new  problems  which,  usually  dictates  the  requirements  of  new  methods.  As 
such,  new  directions  of  research  have  been  continuously  searched  and  persued. 
In  the  end,  we  believe  we  have  a  very  practical  and  reliable  method  at  hand. 
Some  highlights  of  this  development  will  be  presented  In  Sectin  IV.  |*In  the 
meantime,  twenty-two  publications  have  been  published,  one  Ph.D  dissertation 
and  three  Master  theses  have  been  completed,  and  a  final  report  of  a  Workshop 
on  Analog  Automatic  Test  Program  Generation  has  been  completed. 

Recently,  Dr.  Ruey-wen  Liu  was  invited  to  give  an  hour-long  tutorial 
address  to  the  1983  IEEE  International  Symposium  on  Circuits  and  Systems  in 
May,  1983.  This  was  a  rare  honor  for  the  Principal  Investigator.  It  also 
shows  that  the  general  academic  public  has  begun  to  show  interest  in  this 
exciting  area  of  research. 
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The  rest  of  the  Final  Report  is  organized  in  this  way.  The  background  of 
the  analog  analysis  is  presented  in  Section  II.  Some  important  issues  are 
given  in  Section  III.  Some  highlights  of  our  research  activities  will  be  dis¬ 
cussed  in  Section  IV.  A  conclusion  is  given  in  Section  V.  Finally,  a  record 
of  publications  is  in  Section  VI,  and  selected  reprints  are  attached  in  the 
Appendix. 
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II.  THE  BACKGROUND 


During  the  past  quarter  century,  the  engineering  community  has  witnessed 
a  tremendous  strides  in  the  art  of  electronics  design.  On  the  contrary,  elec¬ 
tronics  maintenance  has  changed  little  since  the  day  of  the  vacuum  tube.  As 
such,  our  ability  to  design  a  complex  electronic  circuit  is  quickly  out¬ 
distancing  our  ability  to  maintain  it.  In  turn,  the  price  reductions  which 
have  accompanied  modern  electronics  technology  have  been  paralleled  by  in¬ 
creasing  maintenance  and  operation  costs.  Indeed,  many  industries  are  finding 
that  the  life  cycle  maintenance  costs  for  their  electronic  equipment  now  ex¬ 
ceeds  their  original  capitol  investment. 

Given  the  above,  it  is  quickly  becoming  apparent  that  the  electronics 
maintenance  process,  like  the  design  process,  must  be  automated.  Unfortunate¬ 
ly,  the  50  years  of  progress  in  circuit  theory,  on  which  our  electronics  de¬ 
sign  automation  has  been  predicated,  does  not  exist  in  the  maintenance  area. 

As  such,  the  past  decade  has  witnessed  the  inauguration  of  a  basic  research 
program  to  lay  the  foundations  for  a  theory  of  electronics  maintenance  and  a 
parallel  effort  to  develop  operational  electronic  maintenance  codes. 

Thus  far  the  greatest  success  has  been  achieved  in  the  digital  electron¬ 
ics  area  to  the  point  that  commercialized  test  programs  are  now  readily  avail¬ 
able.  On  the  other  hand,  the  analog  testing  is  still  in  its  infancy.  This  is 
not  without  reasons. 

For  one  reason,  the  analog  fault  diagnosis  had  a  late  start.  The  re¬ 
search  and  theory  development  of  digital  testing  started  in  the  mid  1960's 
when  the  large-scale  computers  were  readily  available.  Not  until  a  decade 
later  did  a  commercialized  test  program  first  become  available.  On  the  other 
hand,  it  was  not  until  the  mid  1970's  that  the  test  technology  community  began 
to  face  up  to  the  analog  test  problem.  Indeed,  even  in  a  predominantly 
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digital  world,  analog  systems  were  not  disappearing.  Analog  systems  were 
proving  to  be  among  the  most  unreliable  and  least  readily  tested  of  all  elec¬ 
tronic  systems.  Assuming  the  same  speed  for  the  development  of  digital  test¬ 
ing,  a  commercially  available  analog  testing  program  would  not  have  been  ready 
until  the  mid  1980's. 

There  are  two  main  reasons  for  this  seemingly  slow  development  of  an 
effective  method  for  analog  fault  diagnosis.  One  reason  is  that  this  is  an 
uncharted  area  of  research.  It  has  no  precedence  to  follow.  More  time  has 
been  spent  to  find  what  the  real  problems  are  and  where  the  difficulty  is. 
Another  reason  is  that  analog  fault  diagnosis  has  inherited  certain  difficult 
problems  which  are  not  shared  by  digit  fault  diagnosis.  These  will  be  ex¬ 
plained  later. 


III.  IMPORTANT  ISSUES  OF  ANALOG  FAULT  DIAGNOSIS 
From  many  experiences  accumulated  in  the  past  years,  successful  ones  as 
well  as  unsuccessful  ones,  we  can  determine  two  major  issues  for  analog  fault 
diagnosis,  not  shared  by  its  counter  part,  the  digital  fault  diagnosis.  The 
two  major  issues  are  the  tolerance  problem  and  the  problem  of  modeling  and 
simulation  of  faulty  components: 

•  Tolerance:  Possibly  the  single  greatest  unknown  in  the  design  of  an  ana¬ 
log  testing  program  is  the  effect  of  the  tolerances  of  the  "good"  compo¬ 
nent  on  the  performance  of  a  testing  program.  This  tolerance  problem  has 
absolutely  no  counterpart  in  the  digital  testing  problem.  The  effect  of 
these  tolerances  can  completely  dominate  the  performance  of  a  testing  pro¬ 
gram.  In  an  analog  circuit,  unlike  digital  circuits,  the  actual  values  of 
circuit  parameters  almost  always  deviates  from  the  nominal  values.  There¬ 
fore,  any  analog  testing  program  has  to  face  up  to  the  problem  of  toler¬ 
ance  problem. 

•  Modeling  and  Simulation  of  Faulty  Components:  Unlike  the  digital  testing, 
a  complete  modeling  (and  thus  simulation)  of  faulty  components  is  not 
available  for  the  development  of  a  testing  problem.  The  modes  of  faulting 
is  too  many  to  encounter.  For  example,  a  faulty  resistor  may  have  an 
infinite  number  of  possible  resistances  (outside  of  the  tolerance).  In 
fact,  it  can  even  be  nonlinear.  A  faulty  capacitor  may  have  a  model  of 
parallel  RC.  A  faulty  operational  amplifier  may  have  a  model  of  22  tran¬ 
sistors  12  resistors  and  a  capacitor!  A  good  transistor  may  behave  like  a 
faulty  one  if  its  bias  is  switched  due  to  a  fault  which  occurred  else¬ 
where!  In  fact,  in  a  nonlinear  analog  environment,  we  are  still  in  the 
process  of  developing  viable  CAD  models  for  nominal  devices,  let  alone  for 
faulty  devices.  As  such,  a  thorough  test  of  the  performance  of  a  testing 
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program  is  impossible.  Furthermore,  each  testing  program  has  to  be  de¬ 
signed  based  soley  on  the  nominal  values  of  the  circuit. 

After  discussions  with  potential  users,  we  also  find  that  there  are  three 
important  measures  for  the  effectiveness  of  a  testing  program. 

•  Test  Points.  Due  to  the  practical  restriction  that  there  are  usually  only 
a  few  nodes  accessible  for  measurement  and  testing,  the  number  of  required 
test  points  has  to  be  as  small  as  possible . 

•  Post-Fault  Computation.  Since  the  post-fault  computation  is  directly  re¬ 
lated  to  the  per  unit  cost,  it  is  important  to  keep  the  post-fault  compu¬ 
tation  time  short  and  simple. 

•  Robustness.  This  issue  has  been  raised  many  times;  only  because  it  is  in¬ 
deed  the  dominant  issue  at  hand.  We  want  the  testing  program  to  be  relia¬ 
ble  when  the  fault/tolerance  ratio  is  small. 


These  two  major  issues  and  there  effective  measures  have  guided  our  research 
in  the  past  years. 


IV.  HIGHLIGHTS  OF  RESEARCH  EFFORTS 


The  fault  analysis  of  analog  circiut,  to  be  sure,  is  an  uncharted  area  of 
research.  There  is  no  precedence  with  which  we  can  follow.  Indeed,  this  has 
been  an  exciting,  enjoyable  and  satisfying  research.  We  continuously  discover 
new  problems  which  usually  dictates  the  requirements  of  new  methods.  As  such, 
new  directions  of  research  have  been  continuously  searched  and  pursued.  In 
the  end,  we  believe  we  have  a  very  practical  and  reliable  method  at  hand. 

Some  highlights  of  this  development  will  be  presented  here. 

The  initial  phase  of  our  research  was  placed  on  understanding  the  prob¬ 
lem.  One  of  the  important  problems  we  have  learned  in  the  period  was  the 
trade-off  problem  between  the  number  of  text  points  and  the  complexity  of 
fault  diagnosis  computation.  It  is  known  that  when  all  the  nodes  can  be  used 
as  test  points,  the  fault  diagnosis  equation  to  be  solved  is  linear  and  the 
computation  is  relatively  simple.  However,  when  the  number  of  test  points  are 
reduced  then  the  fault  diagnosis  equation  becomes  more  and  more  nonlinear. 
Solving  these  simultaneous  nonlinear  equations  involves  complex  computation 
and  the  results  are  less  reliable.  Under  this  constraint,  we  have  attempted 
to  pick  the  "best"  trade-off  point.  We  have  found  that  if  the  number  of  test 
points  are  reduced  at  particular  locations,  the  fault  diagnosis  equations  be¬ 
come  a  set  of  "nice"  nonlinear  equations,  i.e.,  the  sequentially  linear  equa¬ 
tions.  This  set  of  nonlinear  equations  has  a  nice  property  that  can  be  solved 
iteratively  by  a  linear  equation.  Therefore,  we  can  reduce  the  number  of  test 
points  but  not  increase  the  complexity  of  computation.  This  part  of  research 
was  a  successful  one,  see  [1]  and  [2],  but  the  result  was  not  good  enough  to 
have  practical  use.  It  still  requires  more  than  practically  allowed  test 
points  to  achieve  sequentially-linear  diagnosable  circuits.  As  such,  we  have 
to  look  for  new  directions. 


In  Che  next  phase,  many  ideas  have  been  Cried  and  CesCed:  Che  modular 
approach  [3],  Che  accessibilicy  approach  [5],  a  comparison  study  of  frequency- 
domain  approach  and  Cime-domain  approach  [8]  and  Che  fualC  directory  approach 
[12].  We  have  also  invesCigaCed  Che  escimacion,  Cheory  aspect  [9,10,11],  the 
computational  aspect  [4]  and  Che  feedback  system  design  aspect  [6]  of  the 
problem.  All  these  are  novel  and  useful  but  only  for  a  particular  situation. 
We  are  still  left  with  the  lack  of  a  central  thought  or  framework  under  which 
all  these  approach  can  be  applied.  This  leads  to  the  third  and  the  final 
phase  of  our  research. 

At  this  point,  we  begin  to  realize  that  we  have  been  trying  to  solve  an 
impossible  problem,  i.e.,  the  isolation  of  all  possible  combinations  of  faults 
with  a  limited  few  test  points.  With  limited  test  points  available,  we  can 
isolate  all  possible  faults  only  in  theory,  but  not  in  practice.  Because  it 
is  theoretically  possible,  it  kept  us  on  the  wrong  track  for  a  long  time.  The 
difficulty  in  the  implementation  of  such  a  theory  is  that  the  theory  is  based 
on  the  ideal  case,  i.e.,  no  tolerance  for  non-fault  elements.  This  tolerance 
effect  forces  us  to  look  for  new  directions  again. 

'’ith  a  limited  few  test  points,  our  goal  is  to  isolate  only  a  limited 
combination  of  faults.  We  look  for  only  the  cases  when  numbers  of  faulty  com¬ 
ponents  are  limited  by  a  few,  say  k.  This  is  the  k-fault  diagnosis  problem. 

By  turning  our  direction  in  this  way,  we  recognize  that  we  will  miss  some 
situations,  such  as  when  the  number  of  fault  elements  are  greater  than  k. 
However,  the  k-fault  diagnosis  problem  makes  a  lot  of  sense.  First,  a  system 
is  faulty  usually  because  only  a  few  of  its  components  are  faulty.  A  large 
number  of  components  become  faulty  at  the  same  time  is  a  rare  occurrence. 
Therefore,  we  do  not  miss  too  much  by  considering  the  k-fault  diagnosis  prob¬ 
lem.  Secondly,  the  k-fault  diagnosis  problem  was  found  to  be  mathematically 


tractable  and  computationally  simple.  Finally,  by  doing  so,  we  do  no  less 
than  our  counterpart,  the  fault  diagnosis  of  digital  systems.  In  our  counter¬ 
part,  most  of  their  successful  ones  are  only  for  single-fault  diagnosis,  i.e., 
k*l.  In  the  last  two  years,  our  attention  has  soley  been  devoted  to  the  k- 
fault  diagnosis  problem.  Fortunately,  we  have  been  very  successful  and  had 
some  breakthrough  results  [14,16,17,18,19,20,21,22]. 

At  this  stage,  we  have  only  considered  the  linear  circuits.  The  cases  of 
nonlinear  circuits  and  analog/digital  hybrid  circuits  are  left  for  future 
research.  In  the  case  of  linear  circuits,  the  following  has  been  achieved: 

1.  The  k-faulty  diagnosability  problem  is  completely  solved. 

2.  A  method  for  the  design  of  the  location  of  the  test  points  so  that  a 
circuit  becomes  k-fault  diagnosable  is  available. 

3.  A  simple  and  robust  computational  method  is  developed  to  implement 
the  above  theories. 

4.  The  number  of  test  points  required  is  small  even  for  large  and 
complex  circuits. 

As  a  final  test  to  our  porogram,  we  have  tested  our  method  on  a  circuit 
jointly  supplied  by  the  NAVAIR  and  the  Naval  Air  Engineering  Center.  This 
circuit  is  a  video  amplifier  which  consisted  of  21  nodes  and  38  components,  8 
of  which  were  transistors.  Even  for  digital  circuits,  38  components  required 
7  test  points.  We  needed  only  5  test  points  for  our  analog  circuit.  It  is 
well  known  that  the  diagnosis  of  analog  circuits  is  a  much  harder  problem  than 
that  of  digital  circuits.  This  demonstrates  that  our  method  required  very  few 
test  points. 

In  the  meantime,  for  the  same  NAVY  circuit,  the  computation  time  for  each 
fault-isolation  test  was  less  than  10  seconds  on  the  IBM37C.  This  is  amazing, 
for  it  translates  to  less  than  one  dollar  per  unit  cost! 


Finally,  our  method  has  been  demonstrated  to  be  reliable.  The  fault 
deviation  to  the  tolerance  deviation  ratio  is  about  200%  to  5%.  Although  the 
ratio  may  be  a  little  high,  this  is  the  first  method  which  was  shown  to  have 
the  capability  of  taking  any  tolerance  at  all. 


V.  CONCLUSIONS 


In  summary,  we  started  with  an  uncharted  area  of  research.  During  the 
four  years,  the  research  experiences  have  sometimes  been  painful,  hut  always 
exciting  and  rewarding.  Many  directions  have  been  tried  and  failed.  But  each 
time  we  have  learned  something  new.  Finally,  a  practical,  robust,  and  yet  ele¬ 
gant  method  is  at  hand.  This  method  has  been  tested  by  a  circuit  supplied  by 
the  NAVAIR  and  the  Naval  Air  Engineering  Center.  The  result  is  satisfactory 
and  successful. 

The  problem  of  fault  diagnosis  began  to  attract  academic  interest.  It  has 
always  been  that  the  problem  of  fault  diagnosis  is  considered  at*,  a  problem  be¬ 
longing  to  repairing  shops.  This  point  of  view  has  been  changed.  There  have 
been  special  sessions  on  analog  fault  diagnosis  in  the  IEEE  International  Sym¬ 
posium  on  Circuits  and  Systems  in  the  past  two  years  and  definitely  will  also 
be  in  the  next  two  years.  The  principle  investigator  has  been  invited  to  give 
talks  on  analog  fault  diagnosis  problems  at  the  Allerton  Conference  on  Communi¬ 
cation,  Control  and  Computing  f3],  the  National  Electronic  Conference  f 7 1 ,  the 
IEEE  Conference  on  Decision  and  Control  (14],  the  IEEE  International  Symposium 
on  Large  Scale  Systems  [16]  and  many  times  at  the  IEEE  International  Symposium 
on  Circuits  and  Systems  [5,12,18,19].  The  last  talk  [19]  was  an  hour-long  tu¬ 
torial  speech.  He  was  also  invited  to  submit  a  paper  by  the  Journal  of  Society 
of  Instrument  and  Control  Engineers  of  JAPAN  [15]. 

Up  to  this  point,  we  have  just  completed  the  first  phase  of  our  research, 
the  deterministic  model  approach.  Next,  we  will  take  the  stochastic  model 
approach ,  and  finally,  the  artificial  intelligence  model  approach. 

The  stochastic  model  will  incorporate  the  uncertainties,  such  as  the  tol¬ 
erance  into  the  model.  We  have  started  and  nearly  completed  this  part  of  the 
research.  Though  this  part  of  the  research  was  carried  out  after  the  termina¬ 
tion  date  of  the  ONR  contract,  it  may  be  relevant  to  report  that  some  break¬ 
through  results  have  been  obtained.  Theoretically,  we  achieved  the  Cramer-Rao 
lower  bound,  and  hence,  we  have  obtained  the  most  efficient  algorithm.  Practi¬ 
cally,  it  recovered  all  the  faults  that  were  missed  by  using  the  deterministic 
model  approach  before.  With  these  encouraging  results,  we  are  very  optimistic 
that  the  problem  of  analog  fault  diagnosis  will  be  completely  understood  in  the 
very  near  future. 
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Sequentially  Linear  Fault  Diagnosis: 

Part  I — Theory 

RUEY-WEN  LIU,  MEMBER,  IEEE,  AND  V.  VISVANATHAN 


Attract— A  good  mIHm  to  the  tradeoff  probim  between  the  cost  of 
cofttdoa  sad  the  cost  of  teet  potato  le  the  icqucuriaBy  Uaearty  dtegaoe- 
■Me  sysesms.  CeaffHous  eader  wMch  a  sjrsteat  is  seqa endaMy  Uaearty 
dhgaoaaMe  era  developed  la  Pert  L  A  deaiga  procedure  for  the  teet  poiats 
to  MM  these  coaffdeoe  is  given  he  Part  O. 

I.  Introduction 

N  fault  diagnosis  of  analog  systems  we  encounter  two 
classes  of  faults — catastrophic  faults  and  soft  faults 
[17].  The  former  occurs  due  to  an  extreme  change  in  the 
performance  of  some  components)  (a  fuse  blows  or  a 
circuit  shorts  out)  and  results  in  outright  failure  of  system 
performance.  Diagnosis  of  such  faults  [1]  is  usually  based 
on  the  acceptable  premise  that  the  system  failure  is  due  to 
a  catastrophic  fault  in  one  (or  a  few)  of  its  components. 
On  the  other  hand,  when  the  values  of  the  parameters  of 
the  system  drift  there  is  a  degradation  of  system  perfor¬ 
mance  which  is  called  a  soft  fault  [17].  The  drift  in 
parameter  values  is  usually  caused  by  permanent  over¬ 
stress  (high  temperature,  continued  overload  operation, 
material  stress,  etc.)  or  aging.  In  such  cases  one  has  to 
make  the  decision  that  the  system  performance  has 
sufficiently  degraded  for  the  system  to  be  declared  faulty, 
and  the  problem  of  diagnosing  the  fault  is  more  difficult 
than  in  the  catastrophic  case.  This  is  because  the  faulty 
condition  of  the  system  may  be  caused  by  changes  in  the 
values  of  many  parameters.  However,  if  we  can  determine 
all  the  parameter  values,  not  only  can  we  decide  if  a  soft 
fault  has  occured,  but  also  locate  the  faulty  parameters. 
This  approach  will  be  used  in  this  paper. 

In  this  paper  we  consider  soft  faults  in  linear  large-scale 
dynamical  systems  (LSDS),  i.e.,  an  interconnected  system 
whose  components  are  linear  and  time  invariant.  When 
we  study  such  systems  from  the  point  of  view  of  fault 
diagnosis,  we  consider  it  to  have  two  parts — the  system 
parameters  and  the  system  structure.  The  system  parame¬ 
ters  are  real-valued  variables.  The  system  structure  con¬ 
sists  of  all  the  parts  of  the  system  that  are  known  (or 
assumed)  to  be  invariant.  This  may  include  the  location  of 


the  inputs  and  outputs,  the  connections  between  the  com¬ 
ponents  and  the  dynamical  nature  of  the  components. 

Let  the  vector  p  denote  a  subset  of  the  system  parame¬ 
ters  which  is  to  be  diagnosed.  Its  relation  to  the  transfer 
function  is  given  by 

H(s)~h(s,p)  (1) 

where  the  function  h  is  determined  by  the  system  struc¬ 
ture  and  the  nominal  values  of  all  the  parameters  not 
included  in  the  subset  p,  i.e.,  the  parameters  which  are 
assumed  to  be  invariant  or  fault  free.  Since  from  q  input 
and  m  outputs  of  any  linear  time-invariant  system  we  can 
only  determine  the  mXq  transfer  function  matrix  H(s), 
we  have  the  following  definition. 

Definition  1 

The  parameters  p  of  an  LSDS  are  said  lo  be  diagnosable 
if  p  can  be  determined  from  H(s),  i.e.,  h  is  injective. 

Note  that  given  an  LSDS  the  function  h  may  vary 
according  to  which  subset  of  the  parameters  of  the  LSDS 
are  to  be  diagnosed,  and,  therefore,  one  subset  of  parame¬ 
ters  may  be  diagnosable  while  some  other  subset  of  the 
same  LSDS  may  not. 

Since  the  function  h  maps  p  into  the  space  of  symbolic 
transfer  function  matrices  determining  conditions  under 
which  it  is  injective  is  a  difficult  problem.  We  propose 
instead  an  approach  based  on  the  time-domain  repre¬ 
sentation  of  the  transfer  function  matrix  H(s).  This  ap¬ 
proach  is  based  on  the  following  assertion. 

Assertion  l  [2] 

Two  minimal  (controllable  and  observable)  linear  time- 
invariant  state  equation  representations  61  —  [A,ff,C,D] 
and  61 — [A, B,C,D]  with  state  spaces  of  the  same  dimen¬ 
sion  n  are  realizations  of  the  same  transfer  function 
matrix  H(s)  if  and  only  if 

A 

D-D 

and 

CA ‘B -  CA'B,  I -0,  i  -  •  •  In- 1. 


y  u 


to  l. 
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The  Markov  parameters  D  and  CA  '8.  i — 0, 1  •  •  •  2n  -  1 
are  well-defined  functions  of  p.  We  can  therefore  write 

vec[D;CB;-;CA2"~'B]-F(p).  (2) 

Gearly  we  have  Assertion  2. 
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Assertion  2 

The  parameters  p  of  the  LSDS  are  diagnosable  if  and 
only  if  F  is  injective. 

Note  that  F  unlike  A  is  a  real-valued  function  of  real 
variables. 

Assertion  2  requires  that  F(-)  be  injective  for  the 
parameters  p  to  be  diagnosable.  This  is  a  very  strong 
requirement.  A  less  strict  but  equally  useful  condition  is 
that  F(  • )  be  injective  in  a  generic  sense. 1 

Definition  2 

A  function  /:  is  injective  in  a  generic  sense  if 

V  -  { x  €  R"|3jc'  €  fr,  x  *x'  and  fix)  -  fix') } 
is  a  proper  variety. 


In  this  paper,  we  propose  a  solution  to  the  tradeoff 
problem.  It  is  found  that  under  certain  conditions,  the 
nonlinear  equation  (2)  can  be  made  to  be  sequentially 
linear  (see  (31)),  i.e.,  a  set  of  nonlinear  equations  which 
can  be  solved  by  solving  a  set  of  linear  equations  in  a 
sequential  manner.  Therefore,  an  optimal  solution  to  the 
tradeoff  problem  becomes  the  least  number  of  test  points 
which  makes  (2)  sequentially  linear.  Any  additional  test 
points  to  the  optimal  solution  will  not  reduce  the  com¬ 
putation  cost  appreciably.  One  less  test  point  will  cause 
(2)  to  be  “genuinely”  nonlinear,  and,  therefore,  an  in¬ 
crease  of  computation  cost  by  an  appreciable  amount. 
The  two-part  paper  gives  a  partial  solution  to  the  tradeoff 
problem. 

II.  The  LSDS  Model  and  Partittonino 


Definition  3 

A  set  of  parameters  p  is  diagnosable  in  a  generic  sense  if 
F(  )  is  injective  in  a  generic  sense. 

Since  the  “probability”  that  the  parameters  will  take  on 
the  exact  values  of  the  points  on  the  proper  variety  equals 
zero,  generic  diagnosability  is  an  equally  good  property  as 
far  as  fault  diagnosis  is  concerned. 

It  is  not  an  easy  task  to  determine  a  necessary  and 
sufficient  condition  under  which  this  property  holds  for 
an  arbitrary  nonlinear  function.  At  this  stage  it  is  perti¬ 
nent  to  mention  that  if  the  Jacobian  of  the  function  F{  ) 
has  full  column  rank  in  a  generic  sense  it  is  not  necessary 
that  F(-)  is  injective  in  a  generic  sense.  We  will  illustrate 
this  point  with  an  example. 

Example  I:  Consider  the  function 

y(x)~x2 

whose  domain  is  the  set  of  real  numbers.  The  Jacobian  of 
the  function  is 

J fix)  •lx. 

J^,(x)-0  if  and  only  if  x  »0.  Hence,  Jfix)  has  full  column 
rank  in  a  generic  sense.  However,  the  set  V  of  Definition  2 
is,  in  this  case 

F-R-{0) 

which  is  not  a  proper  variety.  Hence,  y(x)  is  not  injective 
in  a  generic  sense. 

Let  us  consider  a  tradeoff  problem  between  the  cost  of 
computing  and  the  cost  of  test  points.  In  general,  (1)  or  (2) 
are  nonlinear  and  the  dimension  of  p  is  large.  Therefore, 
the  computation  cost  is  high  and  the  accuracy  is  low.  This 
situation  can  be  alleviated  by  an  increase  of  the  number 
of  test  terminals.  For  example,  if  every  parameter  can  be 
measured  directly,  the  function  /  or  F  becomes  linear  and 
decoupled.  On  the  other  hand,  every  additional  test  point 
is  associated  with  an  additional  cost,  and  there  is  a  limit 
of  numbers  of  test  points  to  be  added  to  a  circuit.  This  is  a 
major  tradeoff  problem  for  fault  diagnosis  problems. 

'This  concept  was  introduced  to  the  problem  of  fault  diagnosis  by  Sen 
and  Seeks  (3J.  See  (10)  or  the  Appendix  for  the  definition  of  genericity. 


The  LSDS  is  assumed  to  be  an  interconnection  of  single 
input-single  output  (SISO)  components.  The  ith  compo¬ 
nent  may  be  represented  as 

»&(•)  (3) 

where  g,(s)  is  a  known  transfer  function  and  p,  the  param¬ 
eter  to  be  diagnosed,  and  it  has  a  state  equation  repre¬ 
sentation  (4),  where  x,  is  the  state  vector  and  a,  and  6,  are, 
respectively,  the  scalar  input  and  output  of  the  compo¬ 
nent. 

j,  «<£,?,  +  $,  P.a, 

b,  —  G,  j,  +  6i),  p,a,..  (4) 

The  LSDS  is  described  by  the  component-connection 
model  (5)  [5j,  [7],  (18], 

x^Ax  +  BPa 

b  —  Cx  +  DPa  (5a) 


al»fL°*  Lau\\b 


(5b) 


where  4-diag  (G,)  and  B ,  C,  D,  and  P  are  similarly 
defined.  The  vector  x-vec  (;,)  and  a  and  b  are  similarly 
defined.  Equation  (5b)  describes  the  connection  between 
the  components  and  the  input-output  terminals  u  and  y 
of  the  system.  Without  loss  of  generality  it  may  be 
assumed  that  both  the  numerator  and  denominator  of 
g,(s)  are  monic  polynomials  [14]. 

It  has  been  shown  [  4]  that  without  loss  of  generality  the 
LSDS  description  (5)  with  each  component  being  a  SISO 
system  applies  to  LSDS  where  some  of  the  components 
may  be  MIMO  with  a  transfer  function  as  follows: 


Ai&i(j)  •••  Pijgfis) 


By  redefining  the  connections,  each  one  of  the  above  ij 
scalar  transfer  functions  becomes  an  SISO  component. 

We  now  introduce  a  partitioning  of  the  components 
and  rewrite  (5)  based  on  it.  The  partitioning  leads  natur¬ 
ally  to  the  fault  diagnosis  equations  developed  in  the  next 
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section.  Consider  the  SISO  component  described  by  equa¬ 
tions  (3)  or  (4).  Let 

m,  -  (degree  of  the  denominator  polynomial  of 
&(*))- (degree  of  the  numerator  polynomial  of 
&(■*))•  (6) 

Definition  4 

The  quantity  /n,  defined  in  (6)  is  called  the  minimum 
delay  order  of  the  ith  component  and  the  component  is 
called  an  /^-minimum  delay  component. 

Partition  the  components  of  the  LSDS  into  the  classes 
Sq,  S,,-  •  •  Sj,-  •  •  SK  in  the  following  manner.  The  class  Sj 
consists  of  all  the  y-minimum  delay  components  of  the 
LSDS.  The  largest  minimum  delay  order  of  all  the  compo¬ 
nents  in  the  LSDS  is  AT.  On  the  basis  of  this  partitioning 
we  can  rewrite  (5)  as  in  equation  (7).  The  partitioning  of 
the  vectors  a,  b,  and  x  and  the  matrices  A,  B,  C,  D,  P, 
and  L  is  conformable  with  the  partitioning  into  the  classes 
S0  -  •  ■  SK.  For  example,  the  matrix  Pt  is  a  diagonal  matrix 
whose  diagonal  entries  are  the  parameters  of  the  compo¬ 
nents  that  belong  to  the  class  S’,  and  L n  consists  of  the 
gain*  of  the  connections  directed  to  the  components  in 
class  S3  from  those  in  class  S2. 


that  for  values  of  s  whose  magnitude  is  sufficiently  large 


I 


e,ai% 


/+< 


-  j-m  S'* 

It  then  follows  that  for  the  LSDS  description  (7) 
I,Dl~0,ClBl~  I 

and 

A-0 


(8) 


CjA'jBj — 0 

QAr'B,-!, 


j-0,--,i-2,i-2,--,K.  (9) 


Therefore, 

DP ■  diag  (Djl>o,D|l> i  ■ '  ■  ^jc^r)“diag  (Pq,0,-  •  •  ,0). 

(10) 

As  a  consequence,  det  [I-L^DP]** det  [I-L^Pq]  and 
by  Theorem  1  of  Singh  and  Liu  [7]  the  LSDS  (7)  has  a 
state  equation  representation  in  the  composite  state  space 
of  all  its  components  if  and  only  if  det  [/—  Lgo/y^O. 


Assertion  3 

Det  [/-  LoqPqJ^O  in  a  generic  sense. 

Proof:  Det  [I—  L^P0]  is  a  polynomial  in  the  entries  of 
the  matrix  which  are  themselves  polynomials  in  the  vari- 


*0 

Ao 

*0 

BoTo 

oo 

0 

0 

*1 

•^i 

X\ 

B,P, 

a, 

— 

. 

+ 

• 

0 

• 

■ 

0 

• 

*jr 

- 1 

X K 

BkPk 

aK 

- 1 

o 

C0 

*0 

DqP o 

a 0 

0 

0 

bx 

<7, 

x\ 

f>,P, 

«* 

+ 

• 

0 

• 

0 

• 

1 - 

4^ 

i 

u* 

XK 

Dk? k 

aK 

<*0 

J 

Lo k  j 

L*  ' 

b0 

*1 

^10  . 

L\k  i 

Lxu 

bj 

• 

- 

• 

• 

• 

<*K 

L/C0  . 

1 

7- ATX  1 

bK 

-y- 

>»  . 

LyK  | 

Iyu 

u 

(7a) 


(7b) 


Let  p  €  R",  p0  S  R"°,  p ,  e  R"'  •  •  •  pK  6  R**  be,  respectively, 
the  diagonal  entries  of  the  matrices  P,  P0,  P,  •  •  •  PK.  Note 
that  n-  «o+  nt  +  ■■■  nK.  Since  gt(s)  is  monic  if  it  is  part  of 
an  m-minimum  delay  component,  it  can  be  easily  shown 


ables  p0.  Hence,  the  equation 

det[/-LooP0]-0  (11) 

can  be  written  as 
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02) 

where  q(-,  •„■)  is  a  polynomial  and  hence,  a  variety  K,  in 
R“*.  Since 

det  [/]*0 

the  point  p->0  does  not  lie  on  V  and  hence,  V  is  a  proper 
variety.  The  proof  is  completed. 

Due  to  Assertion  3  in  what  follows,  we  consider  only 
those  fault  conditions  for  which  det  [I  -  LggPg^Q.  Let, 

A0  ^/<o+  BoP0(I- Lvf 0)- '  LooCo 
B0*  BoPoV-LnPo)-1 

C0  -  Co  +  PoV-LooPor'LooCo 
Po^PoU-LooPo)-'.  (13) 

Due  to  (10)  and  (13),  (7)  is  equivalent  to  equation  (14). 

x  —  Ax  +  Ba 

b**Cx  +  Da  (14a) 

i  r  a  IT” 

a  L  i  L  b 

~  7  -  (14b) 

where 

,4 -diag  (Aq,Av---,Ak) 

B~ diag  ( B0,BiP j,  •  •  •  ,BKPK) 

C  —  diag  ( Co,  C] ,  •  ■  • ,  CK ) 
D-diag(Po,0,--,0)  (15) 

and 

0  Coi  ’ "  CoK 

I  Lw  Li I  •  •  •  Ltjr 


Note  that 


-xo  Mn 


DL^D~0. 


III.  Conditions  for  Generic  Diagnosabiuty 

Let  5(r)  represent  the  matrix  transfer  function  of  the 
USDS  described  by  (14)  and  Z(s)  the  matrix  transfer 
function  of  the  components  described  by  (14a). 

S(s)«  4*Z(r)[/- 4*Z(j)]  ' -'L„  + 1*,.  (18) 

With  a  power  series  expansion  of  Z(s)  and  [/- LabZ{s))~ 1 
which  are  valid  for  values  of  s  whose  magnitude  is 
sufficiently  large,  we  have 


09) 


Since  S(s )  has  a  state  equation  representation, 

S(s)-  |  MJs'  (20) 

i-O 

where  M'%  are  the  Markov  parameters  of  S(s).  We  want 
to  equate  the  coefficients  of  (19)  and  (20). 

It  follows  from  (15)  and  (9)  that 

CAJ8  -  diag  (m^.m^,-  •  •  ,m^,Py+„0,-  •  •  ,0) 

where 

My  “  CoAiBo 

nty^  CjAfBjPj,  1  </<_/. 

Note  that  m^  is  a  function  of  only  Pt.  Equating  the 
coefficients  of  (19)  and  (20),  we  arrive  at  Lemma  1. 

Lemma  1 

Af,-$,(/V  •  •  ,/*,)  where  i» 0, 1,2,-  •  • 

Imposing  (17),  the  explicit  expressions  of  <f>,  can  be 
obtained: 

VoAv  +  4-  (21a) 

“  [  4»  CyoPo^Oi  ]  •^>i[  Cju  +  LjoLo*  ] 

+/(P0,---,P<_I),  i-l,-*,AT.  (21b) 

The  above  equations  indicates  the  sequential  nature  of  the 
fault  diagnosis  equations.  Note  from  (21b)  that  for  each  i, 
<t>i  is  a  linear  function  with  respect  to  Pt. 

Recall  that  are  the  diagonal  entrices  of  Pt. 

Taking  the  vec  of  equation  (21)  and  the  appropriate  dot 
product  (8),  we  have 

vec  [  M0  -  4,]  —  vec  [  -  Ufa)  (22a) 

vec  [M-f(Po,- •  •  ./»,_,>]  -  /- 1,2,-  •  •  ,AT. 

(22b) 

where 

N(-[Lh,  +  L((>P04H(]7'O[4fc  +  Z>o/()4»]  (23) 

is  a  function  of  P0  only. 

Note  that  the  existence  of  the  solution  to  (22)  is 
guaranteed.  If  \p0  is  injective,  and  if  has  full  column 
rank,  the  solution  is  unique.  Since  the  column  rank  of  Ni 
depends  on  the  value  of  Pg,  the  post-fault  values  of  system 
parameters,  it  does  not  shed  a  light  in  the  design  of 
diagnosable  systems.  As  such,  we  will  treat  the  problem  in 
a  generic  sense. 

Lemma  2 

The  union  of  a  finite  number  of  proper  varieties  is  a 
proper  variety. 

Proof:  Consider  the  k]  +  k2  -  ■  ■  +  km  polynomials  in  n 
indeterminates  with  coefficients  in  R: 

9 !i’9i2’- ' '  *9i*, 

921*922’ '  ‘  '  *92*, 

9ml’ 9m2*'  '  '  ’9mA,' 
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The  Boolean  variable  biJt  /—!,•••  ,m,  j—  1,-  •  •  ,k„  is  de¬ 
fined  as  follows: 

by-  0,  if  qy+0 

by- 1,  if  qy-0.  (24) 

It  follows  from  this  definition  that  the  proper  varieties 
VjC R",  /"l,-  •  •  ,m  described  as 

fy-0,  j—  l,-  *  ■  ,kj 

can  be  equivalently  described  by  the  Boolean  equations 
bi\babik,  -  A 
while  the  Boolean  equation 

by  +  br/-l  (25) 

is  equivalent  to  the  polynomial  equation 

qyq.y-0.  (26) 

The  union  of  the  proper  varieties  Vt,  i ■»  1,-  •  •  ,m,  can  now 
be  described  as 

bll'birblkl  +  b2l”-b2ki+  "^bml'bna'  '  ’  bmk„  “  !•  (27) 

We  can  now  express  this  sum  of  products  as  the  product 
of  stuns, 

44 . 4-1  (28) 

where  b,  is  a  sum  of  some  of  the  Boolean  variables  btj  of 
(27).  Since  (25)  is  equivalent  to  (26)  each  b,  associates  with 
a  polynomial  which  is  a  product  of  some  of  the  polynomi¬ 
als  qtJ.  Therefore,  (28)  represents  a  variety  V.  Clearly 
and  therefore  it  is  a  proper  variety.  The  proof  is 
completed.  Theorem  1  follows  directly  from  Lemma  2  and 
(22). 

Theorem  1 

The  parameters  p  of  the  LSDS  are  diagnosable  in  a 
generic  sense  if  Mpo)  is  injective  in  a  generic  sense  and 
the  matrices  N„  i.e., 

[LlK  +  Ll0P0L0u]rO[lyi  +  £>0P0La\,  i-\,2-K 

have  full  column  rank  in  a  generic  sense. 

Definition  5 

The  parameters  p  of  the  LSDS  are  said  to  be  quasi- 
sequentially  linearly  diagnosable  in  a  generic  sense  if  the 
conditions  of  Theorem  1  are  satisfied. 

Assertion  4 
The  matrix  N,,  i.e., 

[Lh,  +  Ll0P0(I-L(X>P0r'L0u]r 

o[4,+ivw-w\>r,4(] 

has  full  column  rank  in  a  generic  sense  if  and  only  if  it  is 
full  column  rank  for  some  p0—p$- 


Proof: 

Necessity:  The  proof  follows  from  the  definition  of  a 
generic  property. 

Sufficiency:  By  Assertion  3  there  exists  a  proper 
variety  V{  c  R"°  such  that  Vp0  £  Vx  the  matrix 

[lIu+li0p0(i-l1x>p0)-'l0u}t 

o[z>,+z.v>0(/-l00^o)'i4,] 

exists.  When  the  matrix  does  exist  it  does  not  have  full 
column  rank  if  and  only  if  all  its  nt  x  n,  minors  (since  there 
exists  a  p*  where  the  matrix  is  full  column  rank,  it  has  at 
least  n,  rows)  are  zero,  i.e.,  if  the  parameter  values  p0  are 
common  zeros  of  the  polynomials  in  the  numerator  of 
each  minor.  Let  V2  be  the  variety  so  defined.  Since  there 
exists  a  p*  where  the  matrix  is  full  column  rank,  V2  &  ft"0, 
hence  it  is  a  proper  variety.  By  Lemma  2, 

V  —  VXUV2 

is  a  proper  variety.  Since  V/>0£  V  the  matrix  is  defined 
and  is  full  column  rank,  the  proof  is  completed. 

Due  to  Assertion  4,  to  check  if  the  parameters  p  of  an 
LSDS  are  quasi-sequentially  linearly  diagnosable  in  a 
generic  sense  we  need  to  check  the  rank  of  the  matrices  at 
only  one  set  of  parameter  values  (say  the  nominal  values). 

Note  that  if  the  parameters  p  of  an  LSDS  is  quasi- 
sequentially  linearly  diagnosable  then  all  P's,  /  =—  1 , 2, -  -  - , 
can  be  solved  from  the  linear  equations  (22b),  in  a  sequen¬ 
tial  manner.  The  only  exception  is  p „,  which  has  to  be 
solved  from  (22a),  generally  a  nonlinear  equation.  In  the 
next  section  we  propose  a  canonical  form  of  the  LSDS  for 
which  (22a)  becomes  a  linear  equation  in  p0  and  as  a 
consequence  the  fault  diagnosis  equations  (22)  are  indeed 
sequentially  linear. 

We  have  so  far  assumed  that  all  the  parameters  of  the 
LSDS  need  to  be  diagnosed.  The  extension  of  Theorem  1 
to  the  case  where  only  a  subset  of  the  parameters  p  are  to 
be  diagnosed  is  straightforward.  Let  p'-Col  (/>£,•  •  •  ,p’K) 
be  a  vector  which  consists  of  the  parameters  to  be  di¬ 
agnosed.  Equation  (22a)  may  be  rewritten  as 

v-U  (Po)  (29) 

where  v  is  a  vector  determined  by  Mq,  and  the 
nominal  values  of  the  parameters  which  are  in  p0  but  not 
in  p’0.  We  have  Corollary  1  immediately. 

Corollary  1:  The  parameters  p’  of  the  LSDS  are  di¬ 
agnosable  in  a  generic  sense  if  the  function  ^-)  is  injec¬ 
tive  in  a  generic  sense  and  for  i—  1,-  •  •  ,K  the  submatrix 
of  the  matrix  Nif  i.e., 

[  Am  ■*"  LiqPq(1  —  LqqP 0)  1  Lfc  J 

O[z>1  +  Lv0P0(/-L00/>0)-,L0,.] 

consisting  only  of  the  columns  that  correspond  to  parame¬ 
ters  in  p[  is  full  column  rank  in  a  generic  sense. 
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Definition  6 

The  parameters  p'  of  the  LSDS  are  said  to  be  quasi- 
sequentially  linearly  diagnosable  in  a  generic  sense  if  the 
conditions  of  Corollary  1  are  satisfied. 

TV.  The  Canonical  LSDS 


Definition  7 

If  in  an  LSDS  (7)  the  matrix  is  diagonal  it  is  called 
a  Canonical  LSDS. 

Let  4*  and  Poi  represent  the  (ii)th  entry  of  the  matrices 
Lqo  and  P&  respectively,  of  a  Canonical  LSDS.  If  we  now 
redefine  as 


Pot 


£ 


Poi 

1 -  i<xPoi 


(30) 


the  fault  diagnosis  equations  (22)  for  the  Canonical  LSDS 
reduce  to 


vec[M0— (31a) 
vec  [M,-MPo,Pt,-  •  •  ,/i-,)]  -([LjB  +  ZwP0£1„1]r 

0[Z*  +  VVw]>..  *  —  1,2,- •  *  ,JC-  (31b) 


Note  that  (31a)  and  (31b)  are  sequentially  linear. 

The  major  problem  of  diagnosis  of  an  LSDS  is  with  the 
0-minimum  delay  components  since  it  requires  that  a 
nonlinear  function  be  injective  in  a  generic  sense.  On  the 
other  hand,  for  a  Canonical  LSDS  the  corresponding 
condition  reduces  to  that  of  the  matrix  L^QL^  being  full 
column  rank.  Therefore,  it  would  be  advantageous  if  an 
LSDS  can  be  reduced  to  a  Canonical  LSDS.  As  such,  an 
LSDS  can  be  made  sequentially  linearly  diagnosable.  A 
strategy  of  test-point  location  whereby  this  is  achieved  is 
given  in  the  second  half  of  this  two-part  paper. 

V.  Appendix 
GBNBRicmr 

Let 

Pm(PvP»’ ‘  - 'Pm)e  R" 

and  consider  polynomials  ip(Xt ,•  •  •  with  coefficients  in 

R.  A  variety  KcR"  is  defined  to  be  the  set  of  common 
zeros  of  a  finite  number  of  polynomials  ^ „•  •  •  ,\frk: 

Vm  { P •  *iQv  • '  .A,)-0,  i- 1,2,  •  ,*}. 

V  is  proper  if  K<i*R"  and  nontrivial  if  V+G5.  In  this  setup, 
a  property  w  is  merely  a  function  ir:  R"-»{0, 1 },  where 
*(/>)“  1  (or  0)  if  *  holds  (or  fails)  at  p.  Let  K  be  a  proper 
variety.  We  say  that  ir  is  generic  relative  to  V  provided 
«0>)«0  only  if  p  e  V;  and  that  ir  is  generic  provided  such 
a  V  exists. 

The  usefulness  of  this  concept  is  based  on  the  following 
facts.  Let  p0E  V,  where  V  is  nontrivial  and  proper.  If  v  is 
generic  relative  to  V  and  if  ir  fails  at  p^,  m  can  be  made  to 
hold  if  p0  is  shifted  by  a  suitable  perturbation,  arbitrarily 
small.  It  can  also  be  shown  that  such  a  V  has  zero 


“probability”  (Lebesgue  measure).  Therefore,  ir  holds  al¬ 
most  everywhere  in  R". 
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Sequentially  Linear  Fault  Diagnosis: 

Part  II— The  Design  of  Diagnosable  Systems 

V.  VISVANATHAN  and  RUEY-WEN  LIU,  member,  ieee 


Abstract — Baaed  oa  the  results  developed  In  Part  I  a  strategy  of  teat 
point  location  by  which  the  parameters  of  an  LSDS  are  amde  dtitgimnMr 
In  a  generic  sense,  la  developed.  By  appropriate  teat  point  placement,  an 
LSDS'  It  reduced  to  a  canonical  LSDS.  Next,  an  algorithm  for  the 
tynihaale  of  the  teat  points  required  to  make  the  parameters  of  the 
ranoniral  LSDS  sequentially  linearly  tUagnoaahle  in  a  generic  sense,  Is 
given. 

I.  Introduction 

YSTEM  designers  have  so  far  concentrated  on  the 
performance  specifications  of  the  systems  they  build 
and  have  given  little  thought  to  the  diagnosability  of  the 
end  product.  At  a  seminar  nearly  two  decades  ago,  the 
late  Prof.  Seshu  in  a  paper  about  the  future  of  diagnosis 
[9]  stated  that  there  would  be  increased  concern  for  di¬ 
agnosability  in  circuit  design  to  the  extent  that  “the  circuit 
designer  is  going  to  be  required  to  supply  the  diagnostic 
tests  for  the  circuit  he  designs.”  Recently,  studies  [3]— [6] 
have  been  made  on  the  tests  of  diagnosability  from  a 
given  set  of  test  points.  In  this  paper,  a  design  procedure 
for  the  test  points  is  given  so  that  a  given  circuit  is  linearly 
sequentially  diagnosable. 

We  first  present  a  strategy  of  test  point  location  which 
reduces  an  LSDS  to  a  canonical  LSDS.  We  then  discuss 
an  algorithm  to  determine  the  test  point  locations  required 
to  make  the  parameters  of  the  canonical  LSDS  sequen¬ 
tially  linearly  diagnosable  in  a  generic  sense.  We  approach 
the  problem  from  a  graph-theoretic  point  of  view.  We 
view  the  present  work  as  a  preliminary  result  in  a  field 
that  is  largely  unexplored. 

II.  Preliminaries 

We  begin  this  section  with  the  definition  of  a  test  point. 
Definition  1 

An  input  or  output  terminal  introduced  in  a  system  for 
the  sole  purpose  of  diagnosis  is  called  a  test  point. 

An  important  property  that  must  be  satisfied  by  all  test 
points  is  that  during  normal  operation  we  should  be  able 
to  put  them  in  a  state  in  which  they  do  not  affect  the 
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Block  Diagram  Digraph 


(C) 

Fig.  1.  Equivalent  representations. 

system.  The  test  points  then  come  into  the  picture  only 
when  fault  diagnosis  needs  to  be  done. 

The  component-connection  model  of  the  LSDS  has  a 
digraph  representation,  with  the  components,  input  sir,*’ 
generators  and  output  detectors  and  meters  being  ftodr-: 
and  the  connections  being  directed  edges.  The  equiva¬ 
lence  between  the  block-diagram  representation  of  the 
LSDS  [16,  eq.  (5)]  and  the  digraph  representation  is 
illustrated  in  Fig.  1.  In  the  rest  of  this  naper  the  terms 
system  node,  input  node,  output  node,  and  edge  will  be 
used  interchangeably  with  component,  input  terminal, 
output  terminal,  and  connection,  respectively. 

III.  A  Decomposition  Strategy 

A  major  problem  of  diagnosis  is  with  the  0-minimum 
delay  components  since  it  requires  that  a  nonlinear  equa¬ 
tion  be  injective  in  a  generic  sense.  As  we  mentioned  in 
[16]  it  is  not  an  easy  task  to  determine  the  necessary  and 
sufficient  conditions  under  which  this  property  exists.  A 
simple  sufficient  condition  can  be  derived  by  considering 
[16,  eq.  (21a)].  It  follows  from  this  equation  that 

vec^o-^J-fz^®^]  vec^o]  (1) 

where  ®  denotes  jhe  Kronecker  cross  product  [11].  From 
the  definition  of  P0  [16,  eq.  (13)J  it  follows  that 

Po-U  +  PvLny'Po.  (2) 

If  the  matrix  [  Lqu  ®  iy0)  has  full  column  rank  then  equa¬ 
tions  (1)  and  (2)  can  be  solved  for  a  unique  P0.  However, 
for  the  matrix  [  L£  ®  L^]  to  be  full  column  rank  it  is 
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Fig.  2.  Decomposition. 

necessary  that  the  number  of  both  the  input  terminals  and 
output  terminals  be  greater  than  the  number  of  0-mini- 
mum  delay  components.  We  propose  instead  a  strategy  of 
decomposition  which  in  most  cases  will  require  a  fewer 
number  of  test  points. 

Consider  the  subgraph  of  the  LSDS  which  consists  of 
the  O-minimum  delay  nodes  and  all  the  edges  between 
them  except  the  self-loops  and  find  its  minimum  node 
cover  [12].  Partition  the  set  S0  [16,  section  II]  into  Sv  and 
50.  where  Sa  consists  of  the  nodes  that  are  not  part  of  the 
node  cover  and  S0-  consists  of  those  that  are.  Break  the 
output  edges  of  the  nodes  in  S0.  and  introduce  an  input 
and  output  node  in  each  edge  as  shown  in  Fig.  2.  The 
signals  measured  at  these  output  nodes  will  be  described 
by  the  vector  y,  and  those  introduced  at  the  input  nodes 
by  the  vector  u,.  Under  normal  operation  the  correspond¬ 
ing  nodes  are  connected,  i.e.,  u,  — y,.  When  fault  diagnosis 
needs  to  be  done  the  connection  is  broken. 

The  vector  u  which  represents  the  inputs  of  the  LSDS 
other  than  those  introduced  by  tearing  is  partitioned  as 


where  u"  consists  of  the  inputs  that  reach  only  the  compo¬ 
nents  in  So- 

Based  on  this  partitioning  of  u,  the  partitioning  of  S0 
and  the  introduction  of  u,  and  y,,  the  LSDS  description 
[16,  eqs.  (7)  and  (10)]  changes  to  that  in  (4)  where  the 
rationale  behind  the  partitioning  of  some  of  the  vectors 
and  matrices  is  made  self-evident  by  the  corresponding 
subscripts. 
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°“  Note  that  the  L  matrices  associated  with  b0-  are  now 
a\  associated  with  u,  and  that y,  ■  bQ..  Also.  L##  is  a  diagonal 
matrix.  We  now  decompose  the  LSDS  (4)  into  two  subsys¬ 
tems  which  are  defined  as  follows. 

Subsystem  l  (SSI)  is  defined  by  the  component-connec- 
K  J  tion  model: 
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■*0'  ^or 


The  connections  between  SSI,  SS2,  and  the  inputs  and 
outputs  of  the  LSDS  is  given  by 
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Subsystem  2  (SS2): 

Xfy  ”  Aq-Xq*  +  Ba-  P(yQ(y 
b<y  m  CVXV  +  P QfaV 


It  is  easy  to  verify  that  (5)— (7)  are  equivalent  to  the  LSDS 
description  in  (4).  As  a  result  of  the  decomposition  com¬ 
ponents  in  the  set  S0-  become  a  subsystem  separate  from 
the  rest.  Note  that  SSI  is  a  canonical  LSDS.  The  vectors 
dydi,- •  •  ,dK  are  considered  outputs  of  SSI  for  the  con¬ 
sistency  of  the  model  (5)-(7)  with  (4),  and  it  is  only 
outputs  d'  that  are  connected  to  output  terminals  of  the 
LSDS.  The  parameters  of  SSI  are  therefore  sequentially 
linearly  diagnosable  in  a  generic  sense,  by  [14,  theorem  1], 
if 

[IVb;L0.0-]  ©V 


[  (L(m;  Lj©.)  +  LiaP viLo,,,.',  Lq,0-)  ] 

©[  Lyi  +  Lyo./’o'Lgr,  ],  t  —  1 ,  •  •  • ,  * 

are  full  column  rank  in  a  generic  sense.  In  other  words, 
under  this  condition  the  parameters  of  SSI  can  be  sequen¬ 
tially  linearly  diagnosed  in  the  generic  sense  from  the 
LSDS  input-output  measurements,  without  the  informa¬ 
tion  of  SS2.  It  remains  to  show  that  the  parameters  of  SS2 
can  be  linearly  diagnosed  from  the  LSDS  input-output 
measurements  if  the  parameters  of  SSI  are  known.  This  is 
done  in  two-steps,  both  with  an  application  of  a  result  in 
[15]. 

Next,  we  want  to  show  that  SS2  is  diagnosable  from  the 
component  input-output  measurements,  i.e.,  the  parame¬ 
ter  Pq-  can  be  determined  from  ( a0-,b0 •).  Note  that  (6), 
with  the  subscript  0"  omitted  for  simplicity,  is  equivalent 
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x**Ax  +  Ba 

b-PCx  +  Pa.  (8) 

This  is  because  of  the  commutative  property  of  single-in¬ 
put  single-output  components  [16,  eq.  (3)]  between  p,  and 
&(s).  Equation  (8)  can  be  decomposed  into  a  component- 
connection  model: 

x—Ax  +  Be 

d-Cx  +  c  (9) 

s-Pr  (10) 

cl  [0  0  l}\d' 

r  *  /  0  0  s  •  (11) 

b  [0  I  0j[fl 

The  parameters  of  component  (9)  are  known,  while  the 
parameters  of  component  (10)  are  to  be  determined. 
Equation  (11)  is  the  connection  equation.  Formulating  the 
matrix  (12)  of  [15], 


,[-/  0 
o  / 


which  has  full  column  rank.  According  to  Corollary  1  of 
[15],  (r,r)  can  be  determined  from  (a,  b),  without  the 
information  of  P.  Since  P  is  a  diagonal  matrix,  it  can  be 
determined  from  (s,r)  by  (10).  Therefore,  P  can  be  de¬ 
termined  from  (a,  b),  the  input-output  measurement  of 
SS2.  In  fact,  P  can  be  determined  by  solving  a  set  of 
linear  equations  because  (10)  is  linear  with  respect  to  p,, 
and  (s,r)  can  be  determined  from  (a,  A),  independently 
from  P. 

Finally,  from  the  component-connection  model  (5)-(7), 
we  want  to  show  that  the  input-output  measurements 
(<26-,Ao-)  of  SS2  can  be  determined  from  (u,y)  with  the 
assumption  that  the  parameters  of  SSI  are  known  but  the 
parameters  of  SS2  are  not.  Again,  formulating  the  matrix 
(12)  of  [15], 

-/  0  O' 

Qm  0-/0  (|2) 

Q  Qx  Qi  o  (U) 

.  0  0  1. 

where 

Q\ m  Lyu' +  tyv^o r^vw 

Qim  ^y«r  + (13) 

which  has  full  column  rank.  From  Corollary  1  of  [15], 
(oo-.V)  can  b*  determined  from  (u,y)  without  knowing 
the  parameters  of  SS2. 

Definition  2 

For  a  canonical  LSDS  [16,  definition  7],  the  matrices 

Lio  Lyo  (14) 


[Lh,  +  Zw/»0LOB]rQ[ZVl  +  £voi»0L(V], 

(15) 

are  called  diagnosis  matrices. 


The  discussion  above  can  be  summarized  in  the  next 
theorem. 

Theorem  1 

If  the  diagnosis  matrices  of  SSI  are  full  column  rank  in 
a  generic  sense,  then  the  LSDS  (4)  is  sequentially  linearly 
diagnosable  in  a  generic  sense. 

Note  that  the  diagnosis  matrices  of  SSI  concerns  only 
with  Pv  which  are  those  memoryless  components  which 
are  not  part  of  the  node  cover. 

Before  we  discuss  the  design  of  the  diagnosis  matrices 
in  Section  V,  in  the  next  section  we  will  discuss  their 
graphical  structure  since  it  gives  an  insight  into  their 
design. 

IV.  The  Structure  of  the  Diagnosis  Matrices 

Consider  the  nty.q  matrix  [Liu  +  L^PqL^]  and  the  m  x 
n,  matrix  [^-t-  /y0/>0L0l].  The  rows  of  the  matrix  [2.^  + 
4<AAJr([4i +  lyoPoLub  correspond  to  input  (output) 
nodes  and  the  columns  correspond  to  / -minimum  delay 
system  nodes.  The  matrix  [Liu  +  LmPQL0u]T(Lyl  + 
Ly0PoLOi ])  has  a  nonzero  entry  if  the  input  (output)  node 
and  the  system  node  that  correspond  to  that  position  in 
the  matrix  have  an  edge  or  a  directed  path  through  a 
0-minimum  delay  node  between  them.  The  value  of  the 
entry  is  the  total  gain  along  the  above  mentioned  directed 
paths  between  the  two  nodes  computed  by  multiplying  the 
gains  of  edges  in  series  and  adding  the  gains  of  edges  in 
parallel.  Thus  due  to  these  two  matrices  we  have  certain 
directed  paths  from  input  to  output  nodes  through  the 
/-minimum  delay  system  nodes. 

The  matrix  [/.,„  +  2M)/,02^J7O[2>I  +  L^P^L^]  is  a  </m  x 
rtj  matrix.  Each  of  its  columns  corresponds  to  an  /-mini¬ 
mum  delay  system  node  while  each  of  its  rows  corre¬ 
sponds  to  an  ordered  pair  of  input  and  output  nodes. 
There  is  a  nonzero  entry  in  any  position  of  this  matrix  if 
there  exists  a  directed  path  between  the  input  and  output 
nodes  of  the  ordered  pair  corresponding  to  the  row 
through  the  /-minimum  delay  node  corresponding  to  the 
column  and  possibly  some  0-minimum  delay  nodes.  Such 
a  directed  path  is  called  an  i/o  path.  An  i/o  path  is 
identified  by  the  ordered  pair  of  its  end  nodes.  Note  that 
not  all  directed  paths  from  an  input  node  to  an  output 
node  through  an  /-minimum  deiay  node  is  in  i/o  path 
through  that  /-minimum  delay  node.  Only  those  directed 
paths  that  contribute  a  nonzero  entry  to  the  diagnosis 
matrix  are  considered  a s  i/o  paths.  Fig.  3  considers  vari¬ 
ous  directed  paths  between  an  input  and  an  output  node 
as  candidates  for  i/o  paths  through  the  i -minimum  deiay 
node  A .  All  of  them  except  for  Figs.  3(e)  and  3(0  are  valid 
i/o  paths  through  the  node  A.  In  the  case  of  the  0-mini- 
mum  delay  components  the  diagnosis  matrix  reduces  to 
L^-LyQ  and  an  i/o  path  through  a  0-minimum  delay  node 
goes  through  no  other  system  node.  Note  that  in  Fig.  3(d) 
we  have  the  same  / /  o  path  through  the  0-minimum  delay 
node  B4  and  the  I -minimum  delay  node  A. 

If  we  construct  a  bipartite  graph  [13]  with  X  and  Y.  the 
two  disjoint  subsets  of  nodes,  corresponding  to  the  rows 
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Fig.  3.  Candidates  for  i/ o  paths. 

and  columns  of  the  /-minimum  delay  diagnosis  matrix, 
and  draw  an  edge  between  two  nodes  if  the  corresponding 
entry  in  the  matrix  is  nonzero,  then  a  complete  matching 
of  Y  into  AT  [13],  is  a  necessary  and  sufficient  condition 
for  the  diagnosis  matrix  to  be  full  column  rank  in  a 
generic  sense  in  the  space  of  nonzero  entries  of  the 
matrices  L,y,  L^,  L^,  ly„  Lyy,  L&,  and  P0.  However,  since 
we  require  that  the  diagnosis  matrices  be  full  column  rank 
in  a  generic  sense  in  the  space  R"°  complete  matching  is 
only  a  sufficient  condition  for  sequentially  linear  diagno¬ 
sis  in  a  generic  sense. 

V.  Design  of  the  Diagnosis  Matrices 

In  this  section  we  develop  an  algorithm  for  the  design 
of  diagnosis  matrices  that  have  full  column  rank  in  a 
generic  sense.  Therefore,  we  assume  that  all  the  parame¬ 
ters  of  the  canonical  LSDS  need  to  be  diagnosed.  The 
modification  of  the  algorithm  to  the  case  where  only  some 
of  the  parameters  of  the  canonical  LSDS  need  to  be 
diagnosed  is  straightforward. 

If  the  canonical  LSDS  has  no  existing  input  and  output 
terminals  then  the  strategy  of  adding  a  minimum  number 
of  test  points  to  make  the  parameters  sequentially  linearly 
diagnosable  in  a  generic  sense  is  trivial.  We  first  choose 
the  number  of  input  terminals  q  and  the  number  of  output 
terminals  m  which  minimize  the  quantity  (q  +  m)  subject 
to  the  constraint 


qm>  max  n,.  (16) 


The  matrices  L*  and  Iyt,  /— 0, 1,- 
that 

—  !  or 


•  • ,  K  are  then  chosen  so 


(17) 


Then  by  [16,  association  4  and  theorem  1]  the  parameters 
of  the  canonical  LSDS  are  diagnosable  in  a  generic  sense. 


The  procedure  for  constructing  the  matrices  L,u  and  Ly, 
so  that  O  lyt  is  as  given  in  (17)  is  best  understood  by 
using  the  bipartite  graph  discussed  in  the  previous  section. 
We  start  with  the  two  disjointed  subsets  of  nodes  corre¬ 
sponding  to  the  qm  possible  i/o  paths  and  the  nt  /-mini¬ 
mum  delay  system  nodes.  Each  /-minimum  delay  system 
node  is  matched  to  a  different  i/o  path.  In  the  digraph  of 
the  canonical  LSDS  we  then  create  the  i/o  path  corre¬ 
sponding  to  the  system  node  by  introducing  a  directed 
edge  of  gain  1  to  the  system  node  from  the  corresponding 
input  node  and  from  the  system  node  to  the  correspond¬ 
ing  output  node.  This  procedure  is  carried  out  for  /- 
0,1, •••.A'. 

However,  since  all  systems  have  input  and  output  termi¬ 
nals  which  already  exist,  and  have  not  been  introduced 
for  the  purpose  of  diagnosis,  we  should  make  the  best 
possible  use  of  the  i/o  paths  created  by  these  inputs 
(inputs  created  in  the  canonical  LSDS  due  to  the  decom¬ 
position  of  the  LSDS  can  also  be  considered  as  existing 
inputs)  and  outputs.  Stated  from  a  matrix  point  of  view, 
some  of  the  rows  of  and  lyt,  j—0, 1,-  •  •  ,K,  are  already 
fixed  and  what  we  need  to  do  is  add  a  minimum  number 
of  extra  input  and/or  output  nodes  (under  normal  opera¬ 
tion  the  input  signals  will  equal  zero  and  the  output 
signals  need  not  be  measured)  and  appropriate  rows  in  the 
above  mentioned  matrices  to  make  the  parameters  of  the 
canonical  LSDS  sequentially  linearly  diagnosable  in  a 
generic  sense.  Though  this  is  a  conceptually  simple  prob¬ 
lem  it  is  not  an  easy  task  to  create  an  efficient  algorithm 
which  will  solve  this  problem  for  any  given  canonical 
LSDS. 

The  algorithm  that  we  present  makes  the  parameters  of 
the  canonical  LSDS  sequentially  linearly  diagnosable  in  a 
generic  sense  but  it  has  not  been  proved  that  it  will  use  the 
minimum  possible  number  of  test  points  required.  Starting 
with  the  first  matrix  (Ly^OLy)  the  algorithm  sequentially 
makes  each  one  of  them  of  full  column  rank  in  a  generic 
sense.  In  the  algorithm  we  start  with  the  minimum  num¬ 
ber  of  test  points  required,  i.e.,  the  minimum  number 
required  to  satisfy  the  inequality  (16).  Then,  when  we  are 
working  with  the  /-minimum  delay  diagnosis  matrix  we 
first  create  the  possible  i/o  paths  through  /-minimum 
delay  system  nodes,  which  go  between  the  existing  i/o 
nodes  and  the  test  points  and  the  test  points  themselves 
and  help  increase  the  rank  of  the  diagnosis  matrix.  This 
operation  is  done  in  an  efficient  manner  by  considering 
the  system  nodes  in  the  increasing  order  of  the  number  of 
input  or  output  nodes  to  which  they  are  connected.  Note 
that  we  are  allowed  to  create  edges  only  between  test 
points  and  system  nodes  and  not  between  existing  i/o 
nodes  and  system  nodes  since  such  an  edge  would  change 
the  normal  operation  of  the  system.  At  this  stage  extra  test 
points  are  added  only  if  they  are  needed  to  make  the 
diagnosis  matrix  of  full  column  rank.  As  a  result  of  the 
algorithm  each  diagnosis  matrix  has  the  structure  shown 
in  Fig.  4.  Qearly  the  matrix  in  Fig.  4  is  of  full  column 
rank.  Notation  for  the  algorithm  is  as  follows. 
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A^  A^  A^  *—  i/o  p*tha  b«tw««n  axiatin^  i/o  r.o4«s 

0  A2  0  i/o  patha  potwoon  tast  pointa  and  axiatin* 

i/o  nodaa 

0  0  1  4—  i/o  paths  batwaon  taat  points 

a5  aS  *7 

i  full  coluarv  rank 

A2  i  column  acftalon  form  (full  column  ranx) 

Fix  4.  Structure  of  •  diagnosis  matrix. 

The  input  and  output  nodes  that  are  part  of  the  canoni¬ 
cal  LSDS  before  any  test  points  are  added  are  called 
existing  nodes  while  the  test  points  (which  are  added  on 
by  the  algorithm)  are  called  test  nodes: 

a  number  of  existing  input  nodes, 
b  number  of  existing  output  nodes, 
u  signal  at  the  a  existing  input  nodes, 
y  signal  at  the  b  existing  output  nodes, 
n,  number  of  i-min  delay  system  nodes, 

F0  nominal  value  of  0-minimum  delay  parameters, 

K  maximum  delay  order  of  all  the  system  nodes  in 
the  canonical  LSDS. 

Algorithm 

1)  Input  g,  the  subgraph  of  the  canonical  LSDS  which 
consists  of  all  the  nodes,  and  the  edges  between  the 
/•minimum  delay  nodes,  /  -0, 1  •  •  •  K,  and  the  input,  out¬ 
put,  and  0-minimum  delay  nodes. 

2)  Add  to  g,  r  input  test  nodes  and  s  output  test  nodes 
where  r  and  s  are  chosen  such  that 

r  +  s  is  a  minimum 

subject  to 

(a  +  r)(  b  +  s)  >  max 

/-0, 

3)  Let  iV-0. 

4)  If  im 0,  L~  L^QLyQ,  and  else 

“  [  Adi  +  LfffP oLjb]  Q  [  +  lyo^ 0  Av  ]  • 

5)  From  the  columns  of  L  select  a  basis  for  its  column 
space. 

6)  To  each  one  of  the  /-minimum  delay  nodes  that  do 
not  correspond  to  the  basis  selected  in  step  5  (these  are 
called  the  remaining  nodes)  assign  two  numbers  called  the 
{/-number  and  the  T-number,  where  the  {/-number  is  the 
number  of  existing  input  nodes  that  are  connected  to  this 
system  node  either  via  an  edge  or  a  directed  path  through 
a  0-minimum  delay  node,  and  the  T-number  is  the  num¬ 
ber  of  existing  output  nodes  to  which  this  node  is  con¬ 
nected  either  via  an  edge  or  a  directed  path  through  a 
0-minimum  delay  node. 

7) ,  16)  Form  a  stack  Rx{Ry )  of  the  remaining  nodes 
(nodes  in  Sx)  such  that  nodes  with  a  smaller  {/-number 
(  T-number)  are  closer  to  the  top  of  the  stack.  Arbitrarily 
order  nodes  with  the  same  {/-number  ( T-number). 

8) ,  17)  Arbitrary  assign  an  ordering  1  through  s(r)  to 
the  test  output  (input)  nodes.  Let  the  set  S,(S),)*-0. 


9) ,  18)  If  Rx(Ry)  is  empty  go  to  step  16(25). 

10) ,  19)  Pop  the  stack  Rx(Ry).  Let  j*~  1. 

11) ,  20)  Create  a  directed  edge  of  gain  1  in  g  from  this 
popped  node  (test  input  node  j)  to  test  output  node  j  (this 
popped  node). 

12) , 21)  If  a  new  i/o  path  through  /-minimum  delay 
nodes  has  been  created  go  to  step  9(18). 

13) ,  22)  Delete  the  directed  edge  created  in  step  11(20). 

14) ,23)  Let  j*-j  +  1.  If  j<s(r)  go  to  step  1 1(20). 

15) ,24)  Assign  the  system  node  to  the  set  St(Sy)  and  go 
to  step  9(18). 

25)  Let,  n,«-the  number  of  system  nodes  in  the  set  Sy 
and,  ri,«-the  number  of  i/o  paths  in  g  through  /-minimum 
delay  nodes  which  are  between  test  input  nodes  and  test 
output  nodes. 

26)  Add  to  g  f  test  input  nodes  and  s  test  output  nodes 
where  r  and  s  are  chosen  such  that 

r+s  is  a  minimum 

subject  to 

(r  +  F)(s  +  s)>hi  +  it,-. 

27)  Let  r«-r+  r  and  s«— s  +  s. 

28)  Create  h,  new  i/o  paths  through  the  /-minimum 
delay  system  nodes  by  adding  to  g  appropriately  directed 
edges  of  gain  I  between  the  system  nodes  in  Sy  and  the 
test  input  and  output  nodes  so  that  one  and  only  one  of 
these  i/o  paths  goes  through  each  system  node  in  Sy. 

29)  Let  i*-i  +1.  If  /  <  K  go  to  step  4. 

30)  Output  g. 

31)  Stop. 

Note  that  in  steps  7-15,  we  try  to  create  new  i/o  paths 
by  creating  edges  from  a  system  node  to  a  test  output 
node;  and  in  steps  16-24  we  try  to  create  more  / /  o  paths 
by  creating  edges  from  test  input  nodes  to  system  nodes. 

The  algorithm  that  we  have  presented  has  some  short¬ 
comings.  In  order  to  minimize  the  total  number  of  test 
points  we  have  introduced  both  input  and  output  test 
points.  However,  an  input  test  point  requires  the  introduc¬ 
tion  of  a  signal  generator  and  could  therefore  be  called  an 
active  test  point  while  an  output  test  point  requires  only  a 
measurement  and  therefore  is  a  passive  test  point.  In  a 
practical  situation  one  might  wish  to  use  more  passive 
than  active  test  points.  Such  a  requirement  can  be  incor¬ 
porated  into  the  algorithm  by  defining  a  cost  function 
which  assigns  a  greater  cost  to  an  active  test  point  than  to 
a  passive  test  point  and  then  minimizing  this  cost  function 
rather  than  the  total  number  of  test  points.  Another 
drawback  is  that  we  have  assumed  that  an  input  can  be 
connected  to  any  component  and  that  the  outputs  of  any 
set  of  components  can  be  added  to  create  a  system  output. 
In  other  words,  we  have  assumed  that  there  are  no  con¬ 
straints  on  the  connections.  This  may  not  always  be  true. 
One  type  of  constraint  that  is  imposed  on  the  connections 
is  that  the  inputs  or  outputs  of  two  components  may  not 
be  the  same  physical  quantity.  For  example  if  the  outputs 
of  two  components  are  voltage  and  current  they  cannot  be 
added  to  create  a  system  output.  Another  kind  of  con- 
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straint  imposed  on  the  connections  is  that  they  may  have 
to  satisfy  certain  physical  laws.  In  electrical  circuits  for 
example  the  connections  have  to  satisfy  KVL  and  KCL. 
Physical  systems  will  in  some  cases  have  certain  con¬ 
straints  on  the  possible  connections  and,  therefore,  ap¬ 
propriate  constraints  have  to  be  introduced  into  the  algo¬ 
rithm  for  the  design  of  the  diagnosis  matrices. 

In  spite  of  these  weaknesses  the  algorithm  is  useful 
since  it  provides  the  basis  for  a  systematic  design  of 
diagnosabie  systems.  In  contrast,  existing  tests  of  diagnos- 
ability  can  be  applied  only  in  a  limited  way  to  the  design 
of  diagnosabie  systems.  To  amplify  the  point,  these  tech¬ 
niques  work  as  follows.  The  designer  makes  an  “intelli¬ 
gent”  choice  of  test  points  and  checks  the  condition  for 
diagnosability.  If  the  condition  is  not  satisfied,  he  chooses 
another  set  of  test  points  and  continues  the  process  until 
the  condition  is  satisfied.  Such  a  technique  might  be 
effective  for  a  small  system  but  is  inadequate  for  a  system 
that  a  truly  large  scale. 

VI.  Summary  and  Conclusions 

It  is  long  been  recognized  that  the  problem  of  fault 
diagnosis  of  LSDS  is  one  of  exploiting  the  structure  of  the 
LSDS  in  a  suitable  manner.  Saeks  et  al.  [5]  introduced  the 
component-connection  model  of  LSDS  to  the  fault  diag¬ 
nosis  problem  since  it  gave  a  better  insight  into  the 
structure  of  LSDS.  In  this  two-part  paper  we  have  con¬ 
tinued  in  the  same  spirit.  By  writing  the  fault  diagnosis 
equations  in  terms  of  the  Markov  parameters  of  the  LSDS 
rather  than  the  transfer  function  evaluated  at  different 
frequencies  as  in  [3]-[5],  we  are  able  to  use  to  our  advan¬ 
tage  the  property  of  the  minimum  delay  order  of  a  com¬ 
ponent  to  write,  in  Part  I,  the  sequentially  linear  fault 
diagnosis  equations.  The  condition  for  diagnosability  as  a 
rank  test  on  a  set  of  matrices  follows  from  these  equa¬ 
tions. 

In  Part  II  we  explore  the  “automatic”  design  of  di¬ 
agnosabie  systems.  We  first  present  a  strategy  of  test  point 
location,  which  reduces  the  problem  of  sequentially  linear 
diagnosis  of  an  LSDS  to  that  of  a  canonical  LSDS.  We 
then  discuss  an  algorithm  to  determine  the  test  point 
locations  required  to  make  the  parameters  of  the  canoni¬ 
cal  LSDS  sequentially  linearly  diagnosabie  in  a  generic 
sense.  The  algorithm  is  based  on  the  graphical  structure  of 
the  diagnosis  matrices  which  is  also  discussed. 
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ABSTRACT 

A  theory  for  the  study  of  analog  circuit  single-fault-diagnosis  prob¬ 
lem  is  developed  in  this  paper.  First,  the  concepts  of  fuzzy,  precise, 
and  source  components  along  vith  canonical  circuits  are  introduced.  A 
sufficient  condition  and  an  algorithm  are  then  given  to  determine  the  di- 
agnosibility  of  a  canonical  circuit  and  the  identif iability  of  the  faulty 
fuzzy-component  in  the  circuit.  Finally,  an  example  is  included  to  il¬ 
lustrate  the  application  of  the  theory. 

I.  INTRODUCTION 

Frequently,  a  faulty  circuit  results  from  very  few  faulty  elements  in 
the  circuit.  This  observation  has  been  used  in  several  papers  concerning 
fault-diagnosis  problem  of  analog  circuit.  Most  of  them  [1,2,3]  employ 
the  "simulation  approach".  In  this  approach,  the  various  element  values 
are  changed  so  as  to  simulate  element  failures  and  all  the  test-point 
voltages  are  computed.  These  results  are  used  either  to  prepare  a  "fault 
dictionary"  which  would  be  supplied  to  diagnosis  technicians,  or  to  com¬ 
pare  directly  with  the  measured  data  to  determine  the  faulty  elements  un¬ 
der  some  criterion.  The  computation  time  and  memory  size  are  greatly  re¬ 
duced  by  the  use  of  very-few-fault  observation. 

In  this  paper,  the  single-fault  case  is  studied.  By  taking  advantage 
of  this  information,  a  sufficient  condition  based  on  graph  theory  is  ob¬ 
tained.  The  process  involved  a  decomposition  of  a  circuit  into  sub¬ 
circuits  to  satisfy  certain  properties. 

A  novelty  of  the  paper  is  the  way  it  is  treated  for  those  elements 
whose  nominal  characteristics  are  fuzzily  specified,  or  specified  with  a 
large  tolerance. 


II.  PRELIMINARY  DEFINITIONS  AND  ASSUMPTIONS 

A  circuit  component  may  be  a  circuit  element,  a  subcircuit  or  a 
functional  block.  Each  component  is  characterized  by  its  characteristic , 


such  as  the  resistance  of  a  resistor,  the  gain  of  an  operational  amplifier, 


or  Che  characteristic  curves  of  a  transistor. 


Definitions. 

1.  A  nominal  characteristic  is  the  designed  characteristic  of  the 
component. 

2.  An  actual  characteristic  is  the  characteristic  of  the  component 
at  the  time  of  testing. 

3.  A  precise  component,  or  p-comp ,  is  one  whose  nominal  character¬ 
istic  is  precisely  specified.  It  is  fault-free  if  its  actual  character¬ 
istic  is  the  same  as  its  nominal  one. 

4.  A  fuzzy  component,  or  f-comp t  is  one  whose  nominal  characteristic 
is  not  precisely  specified,  but  within  a  specified  range.  It  is  fault- 
free  if  its  actual  characteristic  is  within  this  range. 

5.  A  source  component,  or  s-comp .  is  one  whose  terminal-voltages 
and  terminal-currents  can  be  measured  at  the  time  of  testing. 

Examples  of  fuzzy  components  are  transistors,  operational  amplifiers 
and  electrolytic  capacitors. 

Remarks. 

1.  For  fault  diagnosis  problems,  the  nominal  characteristics  are 
usually  given  and  the  actual  characteristics  are  unknown. 

2.  In  a  realistic  case,  the  actual  cnaracteristic  of  a  precise 
component  may  deviate  from  its  nominal  one  under  fault-free  case,  but  the 
deviation  is  small,  say  less  than  five  percent. 

Assumptions. 

1.  The  actual  characteristic  of  a  component  can  be  uniquely  de¬ 
termined  from  it3  terminal-voltages  and  terminal-currents. 

2.  The  nominal  characteristic  of  a  p-comp  is  voltage-controlled,  Le., 
its  terminal- currents  is  uniquely  determined  by  its  terminal-voltages. 

Definitions. 

6.  A  circuit  is  diagnosable  w.r.t.  a  set  of  measurements  if  all 
faulty  components  can  be  located  from  the  measurements. 

7.  A  component  of  a  circuit  is  identifiable  w.r.t.  a  set  of  measure¬ 
ments  if  its  actual  characteristic  can  be  determined  from  the  measurements. 

III.  THE  MAIN  THEOREM 

In  this  section,  a  restricted  class  of  circuits  will  be  considered. 

It  is  a  class  of  circuits  crucial  to  one-component-fault  problems.  The 
extention  to  the  general  circuit  will  be  considered  in  the  later  section. 


Definition  6.  A  circuit  is  canonical  if  it  satisfies  the  following: 

(Cl)  All  components  are  two-terminal. 

(C2)  It  contains  only  p-comp  and  f-comp. 

(C3)  Its  graph  is  non-separable. 

(C4)  There  is  neither  f-comp  only  loops,  nor  p-comp  only  cutsets. 

(C5)  There  is  at  most  one  component  connecting  any  two  nodes. 

The  canonical  circuit  may  be  excited  by  any  number  of  s-comps,  con¬ 
necting  across  any  two  nodes,  with  any  waveforms,  as  long  as  Assumption  1 
is  satisfied.  With  the  above  condition,  the  following  theorem  can  be  stated. 

Theorem  1.  A  canonical  circuit  is  diagnosable  and  its  faulty  f-comp 
is  identifiable,  if 

(Al)  at  most  one  component  is  at  fault. 

(A2)  all  node  voltages  are  measurable. 

Proof :  The  proof  of"  Theorem  1  is  based  on  the  following  algorithm. 

Step  1.  Calculate  the  terminal  currents  for  all  p-comps,  from  their 
nominal  characteristics  and  measured  node  voltages.  These  currents  are 
called  the  estimated  currents. 

Step  2.  Calculate  the  terminal  currents  for  all  f-comps,  from  the 
estimated  currents  obtained  in  Step  1.  This  is  done  by  the  fundamental 
cutset  equations  [4,5]  associated  with  the  tree  containing  all  and  only 
f-comps.  The  existence  of  such  a  tree  is  guarenteed  by  Condition  (C4) . 
These  obtained  currents  again  are  called  the  estimated  currents. 

Step  3.  Determine  the  characteristic  of  f-comps  from  their  estimated 
currents  and  measured  node  voltages,  called  the  estimated  characteristic. 

Step  4.  Label  each  f-comp  with  the  letter  "T"  or  "F",  according  to 
the  following  rule.  If  the  estimated  characteristic  is  within  its  spec¬ 
ified  range,  label  it  "T",  otherwise,  "F".  There  may  be  zero,  one,  or 
more  than  one  f-comps  labeled  F.  In  the  last  case  all  f-comps  labeled  F 
have  to  be  in  the  same  fundamental  loop. 

Step  5.  Locate  the  fault  component  according  to  the  following  rule. 

1.  If  there  is  no  F-labeled  f-comp,  then  there  is  no  faulty  com¬ 
ponent  in  the  circuit. 

2.  If  there  is  one  F-labeled  f-comp,  then  this  f-comp  is  at  fault. 

3.  If  there  is  more  than  one  F-labeled  f-comps,  then  the  p-comp 
which  defined  the  fundamental  loop  containing  these  f-comps  is  at  fault. 

It  remains  to  be  shown  that  the  decision  made  in  Step  5  is  a  correct 
one.  Since  there  are  at  most  one  faulty  components,  there  are  only  three 


possibilities. 

1)  There  is  no  faulty  components.  Therefore,  the  estimated  currents 
of  all  p-comps  are  actual  currents.  As  a  consequence,  all  estimated  cur¬ 
rents  and  estimated  characteristics  of  f-comps  are  actual  ones.  Therefore, 
there  will  be  no  F-labeled  f-components. 

2)  There  is  one  f-comp  at  fault.  In  this  case,  all  p-comps  are 
fault-free.  By  the  same  reason  as  above,  all  estimated  currents  and 
estimated -characteristics  of  f-comps  are  actual  ones.  Therefore,  there 
will  be  one  F-labeled  f-comp. 

3)  There  is  one  p-comp  at-  fault.  In  this  case  the  estimated  current 
of  this,  and  only  this,  p-comp  is  at  error.  As  a  consequence,  all  the 
estimated  currents  are  at  error  for  those,  and  only  those,  f-comps  in  the 
fundamental  loop  defined  by  this  p-comp.  Therefore,  their  estimated 
characteristics  will  also  be  at  error,  causing  a  switch  in  their  labelling 
from  T  to  F.  Since,  in  view  of  Condition  (C5),  there  are  at  least  two 
f-comps  in  every  fundamental  loop,  there  will  be  two  or  more  f-comps 
labeled  by  F.  This  completes  the  proof  that  the  circuit  is  diagnosable. 

If  one  f-comp  is  at  fault,  from  the  above  discussion,  all  estimated 
characteristics  of  f-comps  are  the  actual  ones.  In  particular,  the 
actual  characteristic  of  the  faulty  f-comp  can  be  obtained.  Therefore, 
it  is  identifiable.  The  theorem  is  proved. 

The  definition  of  canonical  circuit  and  the  conditions  of  Theorem  1 
seem  to  be  very  restrictive.  Actually  this  is  not  the  case.  The  only 
crucial  condition  is  that  there  exists  no  f-comps  only  loops  and  that 
there  is  at  most  one  component  at  fault.  The  others  can  be  alleviated  by 
a  proper  grouping  of  circuit  elements  into  components.  This  is  illus¬ 
trated  by  the  next  example. 


IV.  AN  EXAMPLE 

An  example  will  be  given  which  apparently  does  not  satisfy  Conditions 
(Cl),  (C4)  and  (A2). 

Example  1.  Consider  a  voltage  regulator  as  shown  in  Figure  1.  The 
measurable  node  voltages  are  at  nodes  (1),  (2),  (3)  and  (4),  an:,  he  s- 
comps  are  at  the  input  and  output  terminals.  From  these  measurements,  we 
want  to  locate  the  one-element- fault. 

Note  first  that  node  voltages  at  nodes  (A)  and  (3)  are  not  measurable. 
This  can  be  alleviated  by  grouping  elements  (Q1,Q2,D2,R5)  into  a  4-termi¬ 
nal  f-comp  and  (D3,D4)  into  a  two-terminal  f-comp.  The  other  f-coraps  are 


D1  and  C.  The  rest  of  them  are  p-comps.  Next,  the  above  4-terminal 
f-comp  is  further  represented  by  three  coupled  two-terminal  f-comps.  The 
resulting  circuit  is  shown  in  Figure  2.  Note  that  all  node  voltages  are 
now  measurable. 

In  order  to  check  if  the  circuit  is  canonical  or  not,  the  s-comps 
are  first  removed.  It  is  seen  that  there  is  a  cutset  of  PI  and  P3.  This 
situation  can  be  alleviated  in  the  following  way.  Calculate  the  estimated 
currents  for  PI  and  P3,  and  see  if  the  ..cutset  equation  of  PI,  P3  and  SI 
is  satisfied  or  not.  If  it  is  not  satisfied,  then  the  fault  is  clearly 
at  either  PI  or  P3.  No  further  calculations  need  to  be  done.  If  the  cut 
set  equation  is  satisfied,  then  both  PI  and  P3  are  fault-free.  In  addition, 
the  estimated  currents  are  the  actual  currents.  Therefore,  PI  and  P3  be¬ 
come  s-comps.  The  remaining  circuit  no  longer  has  any  P-comp,  only 
cutsets. 

Since  P3  is  now  considered  as  a  s-comp,  when  removed  the  component 
F2  is  separable  from  the  circuit.  A  close  check,  the  actual  current  of  F2 
can  also  be  found.  Therefore,  it  can  also  be  considered  as  a  s-comp.  The 
resulting  circuit  is  shown  in  Figure  3,  which  is  canonical  except  that 
components  F2,  F3  and  F4  are  coupled. 

By  the  same  reasoning  a  partial  answer  can  be  obtained  and  it  is  sum¬ 
marized  in  Table  1.  Note  that  only  the  partial  labels  are  used  for  the 
decision  and  the  coupling  relation  between  (F2,  F3,  F4)  is  not  used. 


Observed 

Labels 

Elements  at  Fault 

FI 

F5 

F6 

T 

T 

T 

(F2,  F3,  or  F4),  or  no  fault 

T 

T 

F 

F6  or  P4 

T 

F 

T 

F5 

T 

F 

F 

P5 

F 

T 

T 

FI 

F 

T 

F 

P2 

F 

F 

T 

F5 

F 

F 

F 

More  than  one  fault 

Table  1.  Diagnosis  Diagram  for  Example  1. 


Figure  1 


o 

P3 


FI 


-* - Or 

(2)  ^F2 


i  \ 

/  v 

t  \ 


Figure  2 


Figure  3 
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ABSTRACT 

The  conventional  methods  used  for  the  deter¬ 
mination  of  the  rank  of  a  noisy  matrix  are  based 
on  singular  values.  It  is  shown  that  the  singu¬ 
lar-value  approach  will  create  an  inconsistancy 
especially  when  the  matrix  is  large.  The  residual- 
number  approach  is  presented  for  this  case. 

I.  INTRODUCTION 

Let  the  matrices  A  and  A  be  related  by 
A  *  A  ♦  E 

where  the  matrix  E  is  the  noise  (error)  matrix 
whose  elements  are  assumed  to  be  small.  A  rank 
degeneracy  problem  (I]  is  the.determination  of  the 
rank  of  A  from  the  matrix  A  .  He  are  especial¬ 
ly  interested  in  the  case  when  the  rank  of  A  is 
full  rank  but  the  rank  of  A  is  not.  Under  this 
case,  a  square  matrix  A  is  usually  said  to  be  a 
ill-conditioned  or  near  singular  [2],  [3]. 

One  may  conjecture  that  a  nearly  singular 
matrix  must  have  a  near-zero  determinant,  or  a 
near-zero  eigenvalues.  This  conjecture  turns  out 
tp  be  false.  There  exists  a  matrix  which  is  near 
singular  but  whose  determinant  and  eigenvalues 
are  equal  to  unity  [4].  The  current  and  conven¬ 
tional  methods  for  the  determination  of  the  rank  ' 
and  the  ill-conditioness  of  a  matrix  are  based  on 
its  singular  values  [3],  [5J. 

Let  A  *  Rnlxn  ,  with  m  >  n  .  There  exist 
orthogonal  matrices  U  and  V~  such  that 

[Si 

where  £  «  diag.  (s^,  s2,  ....  sn)  and  s1  »  s2 

>  ...  iSn  >  0  .  The  numbers  s j,  53,  ....  sn 

are  unique  and  are  called  the  singular  values. 

The  rank  of  A  is  r  if  and  only  if  sr  t  0 

and  s^j  »  0.  When  a  matrix  is  corrupted  by  a 

noise,  a  characterization  of  the  rank  of  a  matrix 
is  given  by  Golub  et  al  (11.  A  matrix  A  is 
said  to  have  numerical  rank  («,c,r)  with  re¬ 
spect  to  the  norm  1 1  •  1 1  if 

r  «  inf(rank  aj  |  |A  -  B|  j  <_  *} 

•This  research  is  supported  in  part  by  the 
ONR  Grant  N00014-78-C-Q444. 


e  <  4  <_Sup  f n J  j  |A  -  8|  |  <_  n  ■*  rank  B  >_  r> 

The  numerical  rank  is  related  with  the  singular 
values  as  follows. 

Theorem  1(11.  A  matrix  A  has  numerical  rank 
(5,  c,  r)  with  respect  to  ||,||.,  if  and  only  if 


(5,  c,  r)  with  respect  to  H-il,,  if  and  only  if 
sr  —  4  *  £  l*r*V 

Consequently,  the  numerical  rank  of  a  matrix  is 
completely  determined  by  its  singular  values. 
However,  the  singular-value  approach  will  result 
as  an  inconsistancy.  If  the  rank  of  an  nxn 
matrix  is  (n  -  1),  then  by  deleting  an  appropri- 
ate  column  of  the  matrix,  the  rank  of  the  -esult- 
ing  matrix  should  remain  to  be  (n  -  1)  .  This  is 
not  true  as  shown  in  the  following  example  if  the 
singular-value  approach  is  used. 

Example  1  [ 1] .  Consider  the  matrix 


(n)  •  I.  -  SinisIinJ. 


where  In  is  the  nxn  identity  matrix  and 

eT(n)  *  (1 ,  1 ,  . . . ,  1 1 .  It  is  easy  to  show  that 
the  singular  values  of  this  matrix  are  (1,  I,  .... 
I,  0).  When  any  i  columns  are  deleted,  the 
remaining  nx(n  -  i)  matrix  has  singular  values 

(1,  1,  ....  1,  )  .  Clearly,  when  n  is  large, 

there  is  always  one  and  only  one  singular  value 
near-zero.  In  other  words,  for  any  (c,4)  such 
that  1  >.  4  >  e  >  0,  the  numerical  rank  of  the 
nx(n  -  i)  submatrix  is  always  one  less  than  its 
full  rank,  i.e.,  (n  -  i  -  1),  for  any  i  <  <  n. 
This  clearly  is  an  inconsistency  because  if  tne 
rank  of  an  nxn  matrix  is  r  <  n,  then  there 
exists  a  nxr  submatrix  whose  rank  is  full  rank  r. 

In  this  paper,  we  propose  to  use  the  residual 
numbers  to  determine  the  rank  of  a  matrix.  Certain 
nice  properties  will  be  presented. 

II.  RESIDUAL  NUMBERS 

We  consider  A  c  Rmxn  .  Let  the  rank  of  A 
denoted  by  o ( A)  and  the  column-space  of  A  by 
R(A).  Let  a,  denote  the  1-th  column-vector 


of  A,  and  the  remainder  of  A  with  a.  de¬ 
leted.  Without  loss  of  generality,  assume  that 
«  >  n  . 

Definition  1.  The  n  residual  numbers  rf 
of  A  is  defined  by 


r1  (A)  ia/(I  -  AiAt)ai 


for  i  *  1,  2,  ....  n,  where  A(  is  the  pseudo¬ 
inverse  of  A.  [61  and  I  the  identity  matrix. 

It  is  well  known  that 

r .  *  mint | |a^  -  x | | 2  x  c  R(A.l) 

Consequently,  a(  is  a  linear  combination  of  the 
column-vectors  of  Af  if  and  only  if  r.  *  0  . 
Therefore,  A  has  full  rank  if  and  only  if  r^  f  0 

for  all  i  .  These  observations  lead  to  an  alter¬ 
nate  definition  of  the  rank  of  a  matrix  A  .  It 
also  orovldes  a  residual -number  test  for  the  rank 
of  A,  when  not  corrupted  by  noise. 

Definition  2.  An  mxn  matrix  A  has  rank  r 

if. 

(i)  there  exists  an  mxr  submatrix  8 
such  that 

r< (B)  »  0  ,  i  -  l,  2 . r 

(11)  there  is  no  mx(r+l)  submatrix  8 
such  that 

ri  (B)  >0  ,  1  -  1,  2 . (r+l). 

An  important  relation  between  a  matrix  A 
and  its  residual  numbers  is  given  by  Theorem  2. 

Theorem  2.  When  (A^A)*1  exists, 

r<  ■  d,*1  (2) 

where  d{  Is  the  1-th  diagonal  element  of 
(ATA)*1;  otherwise, 

r.  •  11m  d,_1(e)  (3) 

c-0+ 

where  d ^ (c )  is  the  1-th  diagonal  element  of 
(ATA  +  c  l)'1  for  c  »  0. 

As  a  consequence,  a  relation  between  the 
residual  numbers  and  the  singular  values  of  A 
can  be  established,  let  s, ,  s2,  ....  s  be  the 
singular  values  of  A  . 

Corollary  2. 1.  If  A  has  full  column-rank 
l.e. ,  o(A)  ■  n,  then 


n  ,  n  , 

l  -L  .  I  J_ 

1-1  r1  1-1  77^ 


In  view  of  Theorem  2,  it  is  reasonable  to 
conjecture  that  when  o ( A)  <  n. 


where  the  suirmation  is  over  non-zero  r^'s  and 
s^'s  .  It  turns  out  that  this  is  false,  as  shown 
by  the  following  counter  example. 

Example  2.  Consider 

"1  I  O' 

A  -  0  0  1 

_0  0  0  j 

It  is  easy  to  show  that  r^  *  0,  r?  «  0,  r3  «  I 
while  S|  *  ST ,  *  1,  s3  *  0. 

Clearly,  the  above  conjecture  is  not  satisfied. 

The  case  of  noisy  matrices  will  be  considered 

next. 

III.  THE  MAIN  THEOREM 

Let  us  now  consider  the  case  when  a  matrix 

A  t  Rmxn,  with  m  >_  n,  corrupted  by  a  noise 
matrix  V,  i.e.,  - 


The  problem  is. to  determine  the  rank  of  A  from 
the  measured  A  .  Assume  that 

1)  EtVgjI  -0 

2)  EN-.v.,]  *  0  if  i  *  k,  j  »  1 

J  *  0  if  otherwise 

where  E(x]  is  the  expectation  of  x  .  Then, 
ia)  EW]  «  0  (6) 

2a)  E(VTV]  »  mo2I  (7) 

where  m  is  the  number  of  rows  of  V  .  From  (5), 
ATA  *  ATA  ♦  ATV  +  vrA  +  VTV  (8) 

Let  the  elements  of  A  be  bounded,  i.e., 

1*1 j I  i  K  for  some  K  ^  can  be  shown  that 

when  m  — «  -  , 

i  ATV  -  0 

Ill 

i  '/TA  *  0 
m 


IvTV.o2I 

m 

Therefore,  (8)  reduces  to 

ATA  .  ATA  *  mo2I  (9) 

Consequently,  the  singular  values  s  of  A  is 
related  to  the  singular  values  s  of  A  by 

s,2  ■  s,2  +  mo2  (10) 

for  1  ■  1,  2,  ....  n.  This  shows  that  if  sP  *  0 


L 


then  sr  is  near-zero  if  the  noise  is  small.  We  mn(J2  ,  j 
want  to  derive  a  similar  relation  for  the  residual 

numbers.  .  ®as 

of  the  t 

Theorem  3.  compute 

r  •  (A)  -  ft  (A)  ♦  mo2*  ?  (-l^WHa^Af  A*)^  Oef 

i  1  jSl  liii  numenca 

(U) 

In  addition,  the  third  term  can  be  estimated  by  I1 ; 

™o2l  !A*a,|  l2  >  j  (-l)j+1(mo2)jatT(At  ApJa1  I  1*^1  I2 

j*l  - 

where  ar  *  and  sr  is  the  smallest  non-  r 

zero  singular  value  of  A  .  (11 

As  a  consequence,  we  have  Corrollary  3.1. 

Corollary  3.1.  - “ - 

r .  (A)  *  mo2  *  mo2|  I  A*af  1 1 2  >.  ri  (A)  >  rf(A)  +  mo2  *  j~-|  iA*  ai  I 


mna2  *  10  ,  which  is  a  small  number. 

Based  on  Theorem  6,  we  will  give  a  definition 
of  the  numerical  rank  of  A  and  an  algorithm  to 
compute  it. 

Definition  3.  An  mxn  matrix  A  has  a 
numerical  rank  (a,r)  if 


(i)  there  exists  an  mxr  submatrix  B  such 


r.(B)  >  mro* 


1.  2,  ....  r. 


(ii)  there  is  no  mx(r-rl)  submatrix  8  such 
that 


Note  that  when  a  —  0,  both  sides  of  the 
r 

bounds  are  the  same,  and  we  have  an  equality.  Next 


we  need  an  estimation  of  !  |A* 
given  by  the  next  two  theorems. 


Theorem  4. 


! lA  | | 2  »  1 


1  { lAta1 1 


This  is 


if  a1  t  R(Ai) 


if  a.  e  R(A.) 


where  r  is  the  rank  of  A  . 

Theorem  5.  If  o(A)  «  r  <  n,  then  there 
exists  a^  g  R(A^ )  such  that 


|2<  ^ 
1  —  n-r 


Combining  Corollary  3.1  and  Theorem  5,  we 
obtain  the  main  theorem. 

Theorem  6.  If 

-  2 

r ^ (A )  >  mno 

for  i  *  1,  2,  ....  n,  then  ©(A)  ■  n  . 

Not#  that  mno2  may  not  be  large  even  If 
m  Is  large.  For  example,  if  we  are  interested 
in  the  effect  on  the  rank  of  A  due  to  the  round¬ 
off  error  from  a  computer  operation.  Typically, 

o  *  10"®  .  Therefore,  even  if  m  »  n  »  100. 


r . (B)  >  m(r+l)o2  ,  i  *  1,  2 . (r+1). 

The  above  definition  is  an  extension  of 
Definition  2  from  the  noise-free  case  to  the  noisy 
case.  An  algorithm  to  obtain  the  numerical  rank 
of  a  given  matrix  will  now  be  presented. 

Alqori thm. 

Step  0.  let  B  -  A  ,  r  -  n. 

Step  1.  Compute  r^(8)  for  i  *  1,  2,  ....  r. 

Step  2.  Choose  the  smallest  of  r.(B),  say 
r  (B). 

Step  3.  If  rs ( 8 )  <_  mno  and  r  »_  1,  then 
B  -  B  ,  r  *  r-1,  go  to  Step  1. 
Otherwise,  numerical  rank  *  r,  stop. 

It  can  be  shown  that  the  above  algorithm 
yields  the  numerical  rank  (a,  r)  of  Definition  3. 

Let  us  now  go  back  to  Example  1.  It  can  be 
shown  that  the  residual  numbers  of  A(n)  are 
(0,  0,  ....  0).  When  any  column  is  deleted,  the 
residual  numbers  of  the  remaining  nx(n-l) 
matrix  are  (>j,  ....  >j),  which  are  independent 

of  n  .  Since 

>s  >  n(n-l)o2 

if  a  Is  small  enough,  it  can  be  concluded  from 
Definition  3  that  the  numerical  rank  of  A(n)  is 
(a,  n-i).  It  can  be  further  shown  that  when  i 
columns  are  deleted  from  A(n),  the  residual 
numbers  of  the  remaining  nx(n-i)  submatrix  are 

the  same  and  equal  to  yyj-  >  *j  .  Therefore, 

all  such  submatrices  are  of  full  rank  as  they 
should  be. 


The  differences  between  the  residual  numbers 
and  the  singular  values  can  be  seen  from  (4). 

?  -L  .  ?  -I, 

i«l  ri  i«l  Sj 

Usually,  the  residual  numbers  do  not  differ  very 
much  from  each  other,  i.e., 

r.  .  r. 

On  the  other  hand,  when  the  numerical  rank  is 
n  -  1,  it  is  usually  that  sn_^  »  sn  . 

Therefore,  Eq.  (4)  becomes 


sn  n  ri 

i.e.,  the  smallest  singular  value  is  much  smaller 
than  the  residual  numbers  when  n  is  large.  Put 
it  into  another  way,  the  smallness  of  the  resid¬ 
ual  numbers  are  distributed  equally  among  them¬ 
selves,  while  the  smallness  of  the  singular  val¬ 
ues  are  concentrated  on  the  smallest  one.  It 
can  be  unusually  small  when  n  is  large,  and 
therefore,  it  may  not  be  accurately  served  as 
an  indicator  for  the  rank  of  a  matrix.  In  other 
words,  when  a  singular  value  is  near-zero,  there 
are  at  least  two  possibilities.  The  matrix  may 
be  near  singular,  or  the  matrix  may  be  too  large. 

In  conclusion,  the  singular-value  approach 
may  work  well  when  the  matrix  is  small,  but 
fail  when  the  matrix  is  large,  in  which  case, 
the  residual -number  approach  is  better. 
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ABSTRACT 


II.  ACCESSIBILITY  FROM  INPUT/OUTPUT  TERMINALS 


The  problem  of  fault  diagnosis  of  large-scale 
analog  circuit  is  studied.  Any  fault  diagnosis 
procedure  is  limited  by  the  number  of  circuit  param¬ 
eters  to  be  diagnosed.  When  such  limit  is  exceeded 
by  large-scale  circuits,  some  kind  of  tearing  proc¬ 
ess  has  to  be  implemented  before  a  fault  diagnosis 
procedure  can  be  applied.  In  this  paper,  a  tear¬ 
ing  process  via  accessibility  of  subnetworks  is 
presented.  The  necessary  and  sufficient  condition 
for  accessibility  is  obtained.  The  implementation 
of  this  tearing  process  is  discussed.  The  tearing 
process  can  be  applied  to  nonlinear  circuits. 


I.  INTRODUCTION 

In  the  study  of  Large-Scale  Dynamical  Systems 
(LSDS),  in  order  to  simplify  a  problem  we  often  re¬ 
duce  it  from  the  level  of  the  overall  system  to  that 
of  its  components  or  subsystems.  Tearing  or  Dia- 
koptics  [1]  is  such  an  approach  for  the  analysis  of 
large-scale  networks.  For  the  fault  diagnosis  of 
LSDS  there  is  to  technique  which  is  equivalent  to 
tearing.  Existing  methods  of  fault  diagnosis  (for 
example  [3-6])  attack  the  problem  at  the  LSDS  level. 

The  easiest  way  to  transfer  the  problem  of 
fault  diagnosis  from  the  level  of  the  overall  sys¬ 
tem  to  Chat  of  the  subsystems  is  to  have  direct 
access  to  the  Inputs  and  outputs  of  each  subsystem. 
However,  such  direct  access  may  not  be  available  to 
us.  In  such  a  case  if  we  can  determine  the  inputs 
and  outputs  of  the  components  of  interest  from  the 
LSDS  inputs  and  outputs,  we  have  effectively  ac¬ 
cessed  them.  Intuitively,  we  can  say  that  this 
would  be  possible  if  a  mapping  existed  from  the 
space  of  input-output  waveforms  of  the  LSDS  to  the 
space  of  input-output  waveforms  of  the  components. 
Such  a  map  would  be  the  basis  of  our  tearing  ap¬ 
proach.  In  this  paper,  we  explore  these  concepts 
and  determine  the  necessary  and  sufficient  condi¬ 
tions  for  the  existence  of  such  a  map  which  takes 
as  much  advantage  of  the  known  information  as  pos¬ 
sible.  We  then  lay  the  intuitive  basis  for  a 
strategy  of  tearing  which  simplifies  the  problem 
of  fault  diagnosis.  The  results  presented  are  a 
generalization  of  an  earlier  work  by  Saeks,  Singh 
and  Liu  [2]  and  Liu  and  Visvanathan  [7]. 
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The  LSDS  model  consists  of  three  parts,  the 
masked  subsystem,  the  unmasked  subsystem  and  the 
connection-box  as  shown  in  Figure  1.  The  vectors 
u  and  y  denote  the  input  and  output  vectors  of 
the  LSDS,  c  and  d  of  the  unmasked  subsystem  and  r 
and  s  of  the  masked  subsystem.  The  connection- 
box  consists  of  the  connections  between  the  above 
variables.  The  equations  for  the  three  parts  are 

a)  Unmasked  Subsystems 


We  assume  that  the  unmasked  subsystem  is  a  linear 
dynamical  system  described  by: 


x(t) 

“  A  x  (t) 

+  B  c  (t) 

d(t) 

-  C  x  (t) 

+  D  c  (t) 

(1) 


where  A,B,C,  and  D  are  constant  matrices  and  x  is 
the  state  vector  for  the  unmasked  subsystem. 


b) 

Connection 

Box 

The 

connection 

box 

is  described  by 

[10]: 
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The  L's  and  R's  are  constant  matrices.  Note  that 
d,s  and  u  are  inptfts  to  the  connection-box  and  c,r 
and  y  are  the  outputs. 

c)  Masked  Subsystem 

The  inputs  and  outputs  of  the  masked  subsys¬ 
tem  are  related  by  some  functional  form 

s  “  f  •  r  (3) 


which  is  assumed  unknown.  For  example,  the  rela¬ 
tion  (3)  could  be  a  state  equation  or  a  zero-memory 
nonlinear  function. 

Equations  (1),  (2)  and  (3)  completely  describe 
the  LSDS.  The  unmasked  subsystem  has  been  included 
in  the  LSDS  model  to  provide  us  with  a  greater 
flexibility.  Components  that  are  known  to  be  fault 
free  or  have  been  independently  diagnosied  can  be 


2 


included  as  part  of  the  unmasked  subsystem.  Ue 
further  assume  that  the  LSDS  is  well-posed,  i.e., 
the  Initial  value  solution  (x(t),  c(t),  d(t),  r(t), 
s(t))  exists  and  is  unique  for  all  admissible  in¬ 
puts  u(-). 

Definition  1. 

The  masked  subsystem  of  the  LSDS  is  said  to  be 
accessible  from  input/output  terminals,  or  simply 
accessible,  if  Vt£[0,«),  (r(c),s(t))  can  be  unique¬ 
ly  determined  from  (u(t),  y(t))  for  isfO.t]  by  use 
of  equations  (1)  and  (2)  but  not  (3),  with  the  in¬ 
itial  state  x(0)  ■  0  of  the  unmasked  subsystem.  It 
is  said  to  be  anticipatively  accessible  if  (r(t), 
s(t))  can  be  uniquely  determined  from  (u(t),y(T)) 
for  te[0,t+{]  for  some  6  >  0  but  not  for  6  ■  0. 

Theorem  1  [11] 


The  masked  subsystem  of  the  LSDS  is  accessible 
if  and  only  if  the  matrix  J 
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has  full  column-rank. 

Note  that  the  accessibility  only  depends  on 
the  memory less  part  of  LSDS.  The  application  of 
Theorem  1  to  large-scale  networks  is  considered 
next. 

III.  ACCESSIBILITY  OF  SUBNETWORKS 

The  above  result  can  be  applied  to  the  diagno¬ 
sis  of  subnetworks.  A  given  network  can  be  decom¬ 
posed  into  three  parts  as  shown  in  Figure  2.  Those 
R-elements,  which  are  known  to  be  reliable,  are 
placed  in  the  resistive-network  box.  Those  L,C- 
elements  and/or  (nonlinear)  devices  are  placed  in 
the  masked  box.  The  last  box  contains  all  the  ele¬ 
ments  to  be  diagnosed. 

Note  that  (u,y)  and  (r,s)  are  the  port-volt¬ 
ages  and  the  port-currents  of  the  overall  network 
and  of  the  subnetworks  in  the  masked  box  respect¬ 
ively. 

The  unmasked  part  has  a  state-equation  repre¬ 
sentation  (1), 


mined  by  (u,y)  from  its  reduced  resistive  network. 

IV.  ACCESSIBILITY  OF  FURTHER  REDUCED  SUBNETWORK  " 

For  the  purpose  of  testing  the  accessibility, 
it  is  possible  to  further  simplify  the  network. 

In  order  to  do  this,  more  notations  are  needed.  In] 
this  section,  m-terminal  masked  box  is  decomposed  j 
into  m-1  2-terminal  X-devices  and  the  set  of  X- 
devices-branches  is  denoted  by  G^.  Note  that  the 
knowledge  of  all  X-devices  completely  describes 
the  behavior  at  the  terminals  of  the  original  mask¬ 
ed  box.  Similarly,  the  set  of  all  inductor-, 
capacitor-branches  are  denoted  by  G^,  G^  respective 
As  for  the  resistor-branches,  since  some  of  them 
may  be  in  a  particular  tree  we  choose  in  the  net¬ 
work,  we  denote  the  set  of  them  in  the  tree  by  G^, 
and  the  rest  in  the  corresponding  cotree  by  C.. 

Now  consider  a  network  N  satisfying  the  follow¬ 
ing  assumptions. 

(1)  The  network  N  is  a  connected  graph. 

(2)  The  sources,  including  voltmeters  and  ammeters 
are  considered  as  a  set  of  branches  with 
measurable  voltages  and  currents.  The  symbol 

G  is  used  to  denote  the  set. 

s 

(3)  G^UG^,  contains  no  loop,  otherwise  ammeters  are 
inserted  to  break  the  loops. 

(4)  G  UG  contains  no  cut  set. 

d  L 

(5)  All  resistors  have  positive  resistance. 

These  five  conditions  are  assumed  to  be  satisfied  ~ 
the  networks  considered  in  this  section. 

Under  above  assumptions,  there  exists  a  tree 
(t)  which  contains  (GX,GC,GR)  and  does  not  contain 
(Gg,G^,GG).  Let  the  set  of  all  such  t  be  denoted 
by  T.  Then  for  each  t  in  T,  we  have  the  following’ 
equations:  1 


F  F  F 
SX  SC  SR 


-  F  F  F  I 

LX  LC  LR  L 


F  F  F  I 

I  GX  GC  GR  G 


x  -  Ax  +  Be 


d  -  Cx 

I  I 

where  A  ■  0,  C  *  I,  and  B  ■  diag  (C. ,  L  ), 

C  ■  (i-  ,  v,  ),d  -  (v  ,  i  ).  J 

Ci  Lj  Ci  Lj 

Associate  each  network  in  Figure  2,  construct 
a  LC-reduced  network,  or  simply  reduced  network, 
by  replacing  each  inductor  by  an  open  circuit  and 
each  capacitor  by  a  short  circuit  as  shown  in 
Figure  3.  Note  chat  the  reduced  network  is  a  re¬ 
sistive  network. 

Theorem  2.  [Ill 

The  masked  subnetwork  in  Figure  2  is  accessible 
if  and  only  if  r>s)  can  bo  uniquely  deter- 


Ohm's  Law: 


F  F  F  V 

SX  SC  SR  X 

F  F  F  V 

LX  LC  LR  C 

F  F  F  V 

GX  GC  GR  R 


VR  ■  R  *R 


XG  ’  G  VG 


where  R  and  G  are  diagonal  matrices  with  positive 


diagonal  elements. 
Define 


Then  from  Equation  (4)  and  Theorem  2,  the  following 
lemma  can  be  proved. 

Lemma  1. 

The  X-devices  in  network  N  are  accessible  if  and 
only  if  the  associated  matrix  W  has  full  column-rank. 

Although  the  size  of  matrix  W  is  smaller  than 
the  original  matrices,  that  is  only  superficial  be¬ 
cause  some  of  the  submatrices  cannot  be  determined 
without  the  knowledge  of  the  original  matrices.  The 
following  lemmas  will  provide  a  better  solution. 

Lemna  2. 

In  a  network  N,  the  associated  matrix  W  has  full 
column-rank  in  the  generic  sense  [12]  if  the  matrix 

has  full  column-rank. 

Lemma  3. 

The  matrix  F  of  a  network  N  has  full  column- 
rank  if  (1)  there  exists  a  tree  teT  such  that  G„UG_ 
contains  no  loop  in  LC-reduced  network  of  S  and 
(2)  the  matrix  W  associated  with  N  has  full  column - 
rank. 

The  second  condition  in  Lemma  3  will  be  referred 
as  Assumption  (5)  in  the  sequent  paragraphs. 

Note  that  not  only  the  size  of  the  matrix  F 
is  smaller  than  W  but  also  the  matrix  itself  can 
be  determined  by  the  subgraph  N"  constructed  by 
shorting  all  resistor-branches  in  the  particular 
tree  and  opening  all  resistor-branches  in  the  cor¬ 
responding  cotree  in  the  LC-reduced  network  N' .  We 
call  the  subgraph  N"  RLC-reduced  network.  Then 
Theorem  3  follows. 

Theorem  3. 

Let  N  be  a  network  satisfying  Assumptions  (1) 
to  (5),  then  X-devices  in  N  are  genetically  access¬ 
ible  if  and  only  if  the  matrix  F  in  RLC-reduced 
network  associated  with  t  has  full  column-rank. 

Remarks 

1.  Theorem  3  enables  us  to  determine  the  accessi¬ 
bility  of  X-devices  in  a  network  by  dealing 
with  a  considerably  smaller  subnetwork  N"  with 
only  sources  and  X-devices  in  this  subnetwork. 

2.  From  graph  theory  ,  F  ^  has  full  column-rank  if 
and  only  if  G  contains  a  tree  in  RLC-reduced 
network.  Therefore,  to  achieve  the  accessi¬ 
bility  of  X-devices,  we  only  need  to  add  some 
voltmeter-branches  into  G.  to  make  it  contain 

a  tree  in  RLC-reduced  network. 

3.  In  this  theory,  in  order  to  access  all  the  X- 
devices  in  a  network,  the  number  of  sources 
must  be  at  least  equal  to  the  number  of  X- 
devices  as  can  be  seen  from  the  required  full 
column-rank  of  F^. 

4.  The  choice  of  tree  in  a  network  is  crucial  in 
determining  the  minimal  set  of  test  points  to 
obtain  accessibility.  Altough  the  algorithm  of 


finding  the  tree  is  still  under  development,  it  is 
quite  possible  to  pick  the  tree  .in  a  network  of 
reasonable  scale  by  inspection  as  shown  in  Example 


Example  1 


Consider  the  accessibility  of  the  two  tran¬ 
sistors  in  the  two-stage  amplifier  circuit  as 
shown  in  Fig.  4a.  It  is  clear  to  see  that  (C  , 

T.  ,  C2,  T^,  C  )  form  a  loop.  From  Assumption *■( 2)  , 
the  loop  can  Be  broken  by  inserting  an  ammeter  A 
in  series  with  C,.  Then,  by  choosing  the  tree 
consisting  (X  ,  X  ,  X  X  ,  R  )  in  Fig.  4b,  the 
associated  RLC-reduced  network  is  shown  in  Fig. 

4c.  Apparently,  the  branches  (E  ,  E  ,  E  ,  A)  con¬ 
tains  a  tree  in  Fig.  4c.  Thus,  (X  ,  X  ,  X  X,), 
so  that  (T^,  T,),  are  accessible  after  the^  4 
insertion  of  tne  ammeter. 


v  V.  APPLICATION  TO  TEARING  PROCESS 

The  purpose  of  diakoptic  or  tearing  process 
[1]  is  to  find  a  way  of  partitioning  a  large-scale 
network  into  smaller  subnetworks  so  that  the  solu¬ 
tion  of  the  large-scale  network  can  be  obtained  by 
solving  the  (decoupled)  subnetworks.  Clearly,  this 
represents  a  reduction  in  computation  time. 

If  fault  diagnosis  is  of  our  interest  instead 
of  the  network  solutions,  new  tearing  process 
should  be  developed  so  that  it  is  compatible  to 
fault  diagnosis  problem.  The  accessibility  can 
fulfill  such  purpose.  Let  us  consider  the  network 
in  Figure  2.  If  each  masked  subnetwork  is  diagnos- 
able  from  its  input/output  pair  (r,s)  and  is  acces¬ 
sible,  then  the  entire  network  is  diagnosable  from 
its  input/output  pair  (u,y).  This  is  because  (r,s) 
can  be  obtained  from  (u,y)  independently  from  the 
characteristics  of  masked  subnetworks. 

In  summary,  if  accessibility  is  achieved,  one 
can  diagnose  the  entire  network  by  diagnosing  each 
of  the  (smaller  and  decoupled)  masked  subnetworks 
Individually. 
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Abstract — The  pwMw  of  *dphg  a  feedback  system  with  pwuaribud 
penpettha  it  Hacked  via  a  fractional  rcpwacatafloa  approach  (a  feedback 
ayateai  —lyils  aad  ayaiheala.  To  (kb  ead  we  let  H  demote  a  riag  of 


ratio  of  two  operator!  In  H.  This,  la  turn,  leads  to  a  *apllfli  d  test  to 
dsttnahM  whether  or  aot  a  feedback  system  la  which  that  pleat  h 
•■bedded  has  t^te  preac^^^ieti  properties  aad  a  complete  c^teracteidssdioa  of 
those  wifiuibon  which  will  “place”  the  feedback  system  la  .  The 
theory  Is  forasalaled  arloarntteally  to  permit  Its  sppHrarioa  In  a  wide 


I.  Introduction 

INTUITIVELY,  the  linear  feedback  system  design  pro¬ 
cess  may  be  broken  down  into  three  steps:  modeling, 
analysis,  and  synthesis;  each  of  which  may  be  carried  out 
via  a  multiplicity  of  time  and  frequency  domain  tech¬ 
niques.  In  engineering  practice,  however,  the  three  steps 
are  loosely  matched  to  one  another.  The  purpose  of  the 
present  paper  is  to  use  fractional  representation  models  to 
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Fig.  1.  Single-variate  control  system. 


the  analysis  and  synthesis  of  feedback  systems.  Here,  if 
one  desires  to  design  a  system  with  prescribed  properties 
tht  given  plant  is  initially  modeled  as  a  quotient  of  two 
operators,  each  of  which  has  the  desired  properties.  Once 
such  a  model  has  been  specified  a  similar  model  may  be 
formulated  for  the  feedback  system  constructed  from  that 
plant  which,  in  turn,  may  be  used  to  determine  whether  or 
not  the  feedback  system  has  the  desired  properties.  More¬ 
over,  the  set  of  compensators  which  will  cause  the  feed¬ 
back  system  to  have  the  prescribed  properties  may  be 
completely  characterized  in  terms  of  such  a  model.  As 
such,  by  choosing  a  model  for  the  plant  which  is  matched 
to  the  design  criteria  the  analysis  and  synthesis  processes 
for  a  feedback  system  may  be  greatly  simplified. 

These  ideas  are  illustrated  by  the  following  derivation 
of  the  set  of  stabilizing  compensators  for  the  single  variate 
control  system  of  Fig.  1. 

We  say  that  a  transfer  function  p(s)  is  exponentially 
stable  (exp.  stable)  if  p(s)  is  a  proper  rational  function  with 
poles  having  negative  real  parts.  Although  the  plant  may 
naturally  be  modeled  as  a  quotient  of  coprime  polynomi¬ 
als  [16],  [19]  p(s)m<x(s)/b(s)  since  our  ultimate  goal  is  a 
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stable  system  we  prefer  to  model  p(s)  as  a  quotient  of  exp. 
stable  rational  functions 

p(s)  -  n(s)/d(s)  -  [  a(s)/m(s)  ]  [  b(s)/ m(s)  ]  ~ '  (1.1) 

where  m(s)  is  strictly  Hurwitz  polynomial  of  degree  equal 
to  the  degree  of  b(s).  Moreover,  since  a(s)  and  b(s)  are 
coprime,  the  rational  functions  n(s)  and  d(s)  are  coprime 
in  the  sense  that  there  exist  exp.  stable  rational  functions 
u(s)  and  v(s)  such  that 

u(s)n(s)  +  v(s)d(  s)  -  1 .  (1.2) 

Similarly,  we  assume  that  our  compensator  is  modeled  as 
a  quotient  of  exp.  stable  rational  functions,  c(s)  - 
x(s)/y(s),  which  are  coprime  in  the  above  sense.  Now,  a 
little  algebra  will  reveal  that  the  closed-loop  system  trans¬ 
fer  function  from  input  u  to  output  y  is  given  by  a  ratio  of 
exp.  stable  rational  functions  in  the  form 

*  "W [  y(s)d(s)  +  *(*)*(*) ]  ~  ( 1  -3) 

Moreover,  it  can  be  shown1  that  /^(s)  will  be  stable  if  and 
only  if 

[y(j)</(j)  +  x(j)/i(s)]-*(s)  (1.4) 

has  an  exp.  stable  inverse.  Since  k(s)  is,  itself,  exp.  stable 
this  implies  that  the  feedback  system  will  be  exp.  stable  if 
and  only  if  k(s)  is  nonzero  for  all  Res>0,  including  oo. 
An  exp.  stable  function  with  these  properties  is  called 
miniphase.  As  such,  the  problem  of  synthesizing  an  exp. 
stable  feedback  system  reduces  to  the  solution  of  (1.4)  for 
exp.  stable  rational  functions  x(s)  and  y(s)  given  exp. 
stable  functions  n(s)  and  d(s)  and  a  miniphase  function 
*(•»)• 

By  direct  substitution  one  may  verify  that 

y*(j)-r(j)/i(j)  and  **($)- -r(s)d(s)  (1.5) 

satisfy  the  homogeneous  equation 

y*(s)d(s)+x*(s)n(s)-0  (1.6) 

for  all  exp.  stable  rational  functions  r(s).  Moreover,  since 
n( j)  and  d(s)  are  coprime  it  follows  that  all  exp.  stable 
rational  solutions  of  (1.6)  are  of  this  form  [15],  [18].  On  the 
other  hand,  a  particular  solution  of  (1.4)  may  be  obtained 
by  multiplying  (1.2)  by  k(s),  which  yields 

yf(s)<"k(s)v(s)  and  x'(j)-1c(j)m(j).  (1.7) 

As  such,  if  we  let  r(s)  vary  over  the  set  of  exp.  stable 
rational  functions  and  k(s)  vary  over  the  set  of  miniphase 
functions  we  obtain  a  complete  parameterization  of  the 
stabilizing  compensators  for  our  feedback  system  in  the 

'See  Um  axiomatic  derivation  of  Section  III  for  the  details. 


form 

x(s)  _  [ k(s)u(s)-r(s)d(s )] 

A*)  [k(s)v(s)  +  r(s)n(s)J 

[u(s)-w(s)d(s)] 

[t>(s)+w(j)/i(s)] 

where  w(s)  —  r(s)/ k(s)  ranges  over  the  exp.  stable  rational 
functions. 

A  comparison  of  (1.8)  with  the  class  of  stabilizing 
compensators  derived  by  Youla,  Bongiomo,  and  Jabr 
[24],  [25],  [29]  will  reveal  that  the  two  results  differ  only  in 
that  our  u(s),  v(s),  n(s),  and  d(s)  are  exp.  stable  rational 
functions  while  theirs  are  polynomials.2  Unlike  their 
analytic  derivation,  however,  the  above  result  was  ob¬ 
tained  via  elementary  algebraic  operations.  Indeed,  the 
only  properties  of  the  exp.  stable  rational  functions  em¬ 
ployed  are  their  closure  under  addition  and  multiplication 
together  with  the  fact  that  the  identity  is  an  exp.  stable 
rational  function,  i.e.,  the  exp.  stable  rational  functions 
form  a  ring  with  identity.  As  such,  if  the  exp.  stable 
rational  functions  of  the  above  derivation  were  to  be 
replaced  by  any  prescribed  ring  of  single-input  single-out¬ 
put  systems,  (1.8)  would  yield  a  complete  characterization 
of  the  compensators  which  would  “place”  the  feedback 
system  in  that  ring.  If  one  works  with  a  ring  of  rational 
functions  with  poles  in  a  prescribed  region  a  solution  of 
the  pole  placement  problem  is  obtained  [18],  whereas,  if 
one  chooses  to  work  with  stable  transcendental  functions 
a  solution  to  the  stabilization  problem  for  distributed 
systems  is  obtained  [7],  [8]  etc.  Indeed,  with  minor  modifi¬ 
cations  the  derivation  can  be  extended  to  noncommuta- 
tive  rings  thereby  including  multivariate  and  time-varying 
systems.  In  each  case,  a  simple  solution  to  a  fundamental 
problem  of  feedback  system  design  is  obtained  by  virtue 
of  choosing  a  model  for  the  given  plant  which  is  matched 
to  the  ultimate  goal  of  the  design  problem.  In  particular,  if 
we  desire  to  design  a  feedback  system  which  lies  in  a 
prescribed  ring  of  operators  we  model  the  plant  as  a 
quotient  of  operators  from  that  ring. 

Consistent  with  the  above  philosophy  the  following 
section  of  the  paper  is  devoted  to  the  formulation  of  an 
axiomatic  theory  of  fractional  system  representation. 
Here,  a  given  system  is  modeled  as  a  quotient  of  two 
operators  lying  in  a  prescribed  ring  H.  The  corresponding 
feedback  system  analysis  and  synthesis  problems  are  then 
studied  in  the  succeeding  sections.  In  particular,  Section 

III  is  devoted  to  the  problem  of  determining  whether  or 
not  a  feedback  system  lies  in  H  given  that  its  plant  is 
represented  as  a  quotient  of  systems  from  H  while  Section 

IV  is  devoted  to  the  problem  of  characterizing  those 
compensators  which  will  “place”  the  feedback  system  in 
H.  The  resultant  axiomatic  theory  of  feedback  system 
design  is  applicable  to  multivariate,  time-varying,  distrib- 

sFrom  •  computational  point  of  view,  it  is  more  convenient  to  repre¬ 
sent  rational  functions  as  ratios  of  polynomials,  as  per  Youla  tt  at. 


utcd,  and  some  multidimensional  systems  and  includes 
the  stabilization,  pole  placement,  and  feedforward  design 
problems.  Several  of  these  applications  are  illustrated  by 
the  examples  of  Section  V.  In  the  final  section  of  the 
paper  a  partial  generalization  of  the  theory  to  nonlinear 
systems  is  described.  This  follows  the  algebraic  pattern 
established  in  the  linear  case  but  is  formulated  in  terms  of 
a  left-distributive  ring  to  model  the  properties  of  a  nonlin¬ 
ear  system  [23]. 

II.  Axiomatic  Theory 

Table  I  displays  several  examples  of  the  axiomatic 
system  developed  below.  Reference  to  it  will  help  in 
visualizing  the  breadth  and  significance  of  the  theory. 
Additional  examples  also  appear  in  Section  V. 

Let  G  be  a  (not  necessarily  commutative)  ring  with 
identity  and  let  H  be  a  subring  of  G  which  includes  the 
identity.  The  feedback  system  and  its  subsystems  will  be 
represented  by  operators  which  are  elements  of  G.  The 
compensator  will  be  chosen  so  that  the  overall  system  will 
be  represented  by  an  operator  in  the  subring  H. 

We  define  two  multiplicative  subsets  [2],  [27]  of  H, 

I-{hEH\h~xEG),  (2.1) 

i.e.,  /  is  the  set  of  elements  of  H  which  have  an  inverse  in 

G ; 

J-[hEH\h~xEH),  (2.2) 


elements  of  H.  Note  that 

J  Cl CH  cG.  (2.3) 

Given  the  above  structure  we  say  that  a  system  g  E  G 
has  a  right  fractional  representation  in  {G,H,I,J}  if  there 
exist  nrEH  and  drEl  such  that  g~nrd~x.  Furthermore, 
we  say  that  the  pair  (nr,dr)EH  X  H  is  right  coprime  if 
there  exist  ur  and  o,  in  H  such  that 

urnr  +  vrdr  =  l.  (2.4) 

The  right  fractional  representation  nrd~ 1  in  {G,H,I,J}  is 
said  to  be  right  coprime  if  the  pair  (nr,dr)  is  right  coprime. 

The  relationship  between  our  concept  of  coprimeness 
and  the  usual  common  factor  criterion  for  coprimeness 
[28]  is  given  by  the  following  properties. 

Property  1:  Let  the  pair  (nr, dr)EH  X  H  be  right 
coprime.  Let  nr  and  dr  have  a  common  right  factor  rEH, 
i.e.,  n,  -  x,r,  dr  - yrr  for  some  xrEH  and  yr  E  H.  Then  r 
has  a  left  inverse  in  H. 

Proof:  Substitute  the  assumed  factorizations  of  nr 
and  dr  into  (2.4)  and  obtain 

urnr  +  c,dr"(«r^  +  »^r)'’“l-  (2.5) 

Since  H  is  a  ring,  urxr  +  v^yr  E  H.  From  (2.5)  it  follows 
that  r~L—  urxr  +  vryr  is  a  left-inverse  of  r.  ■ 

Property  2:  Let  g  —  nrd~x  be  a  right  coprime  fractional 
representation  of  g  in  {G,H,I,J}.  Let  be  a 

second  (not  necessarily  coprime)  right  fractional  repre¬ 
sentation  of  g  in  { G,H,I,J }.  Then  there  exists  wrinW 
such  that 


i.e.,  J  is  the  subgroup  of  H  consisting  of  all  invertible 


x,-V  and  y,-d,r. 


(2.6) 
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Proof:  Given  the  two  factorizations  of  g,  let  /■- 
d~  V,;  hence  rSG.  Then 

y,-d,r  (2.7) 

and,  performing  calculations  in  the  ring  G,  we  obtain 

-  gyr  ~(nA~  l)y,  -  nr(dr~  V,)  -  n,r.  (2.8) 

From  (2.7)  and  (2.8),  r  is  a  common  right  factor  of  x,  and 
yr.  To  show  that  r&H,  consider 

r -  V,  -  ( u,n,  +  vrdr)d~  xyr  -  urnrd~  'yr  +  vy, 

-  urgyr  +  vry,  -  u,x, +  v,yrEH  (2.9) 

where  we  used  the  equality  g  —  x,yf 1  »•  nrd~ 1  to  derive 
(2.9).  ■ 

Although  G  is,  in  general,  a  noncommutative  ring,  the 
entire  theory  developed  above  for  right  fractional  repre¬ 
sentations  can  be  replicated  for  left  fractional  representa¬ 
tions.  In  particular,  we  say  that  gE  G  has  a  left  fractional 
representation  in  {G,H,I,J}  if  there  exist  n,EH  and  d,El 
such  that  g^d,~ln,.  Furthermore  we  say  that  the  pair 
( n„d,)EH  X  H  is  left  coprime  if  there  exist  u,  and  o,  in  H 
such  that 

n,U'  +  d,v,  •  1 .  (2.10) 

The  left  fractional  representation  d,~  xn,  is  said  to  be  left 
coprime  if  the  pair  (n,,d,)  is  left  coprime.  With  these 
definitions  the  existence  of  a  common  left  factor  for  &  left 
fractional  representations  of  g  is  characterized  by  the 
following  properties. 

Property  1‘:  Let  the  pair  (n,,d,)  be  left  coprime.  Let  n, 
and  d,  have  a  common  left  factor  /  in  H,  i.e.,  nt  —  lx,, 
d,^fy,  for  some  x,EH  and  y,EH.  Then  l  has  a  right 
inverse  EH. 

Property  2’:  Let  gmd,~xn,  be  a  left  coprime  fractional 
representation  of  g  in  {G,H,I,J}.  Let  g - yf  'x,  be  a 
second  (not  necessarily  coprime)  left  fractional  repre¬ 
sentation  of  g  in  { G,H,I,J }.  Then  there  exists  an  /  in  H 
such  that 

x,  -  In,  and  ytmld,.  (2.11) 

The  above  properties  of  a  coprime  fractional  repre¬ 
sentation  have  all  been  derived  under  the  assumption  that 
such  a  representation  exists.  Of  course,  if  G  denotes  the 
rational  matrices  and  H  denotes  the  polynomial  matrices 
the  existence  f  a  coprime  representation  is  implied  by 
classical  analysis  [16],  [19].  Indeed,  the  classical  analysis 
readily  extends  to  the  case  where  H  is  taken  to  be  the  exp. 
stable  rational  matrices  or  the  ring  of  proper  rational 
matrices  with  poles  in  a  prescribed  region  [18].  On  the 
other  hand  for  multidimensional  [26],  distributed  [4],  [8], 
and  time-varying  systems  [11],  [15]  there  is  no  assurance 
that  an  arbitrary  gEG  will  admit  a  fractional  representa¬ 
tion  nor  even  that  the  set  of  gEG  which  admit  such  a 
representation  will  be  a  linear  space.  Moreover,  all  g's 
which  admit  a  fractional  representation  may  not  admit  a 
coprime  fractional  representation  [26].  In  general,  the  set 


of  gEG  which  admit  a  fractional  representation  in 
{ G,H,I,J )  will  form  a  subring  of  G  if  and  only  if  the  Ore 
condition3  is  satisfied  while  criteria  for  coprimeness  have 
been  formulated  in  various  special  cases  though  no  gen¬ 
eral  theory  exists  [  l  ],  [4],  [26].  The  standard  condition  for 
the  existence  of  fractional  representations  which  are 
coprime  in  the  sense  of  (2.4)  is  that  H  be  a  right  principal 
ideal  domain. 

Reference  to  Table  I  shows  that  in  applications  it  is 
important  to  have  conditions  under  which  g  will  be  in  H 
and  these  conditions  should  be  expressed  in  terms  of  its 
fractional  representation. 

Property  3:  Let  g  »  nrd~ 1  with  nrEH  and  dr  £  I. 

a)  If  dr  E  J,  then  gE  H. 

b)  If  g  *  nrd~ 1  is  a  right  coprime  fractional  representa¬ 
tion  of  g  in  { G,H,I,J ),  then  gEH  implies  that  drEJ. 

Proof: 

a)  We  have  drEJ\  hence  by  (2.2),  d~]EH  and  thus 
nrdr~l-g<=H. 

b)  We  have  gEH.  Furthermore,  n, * gd„  dr**\dr  im¬ 

plies  that  dr  is  a  right  common  factor  of  nr  and  dr;  hence 
by  Property  1,  dr  has  a  left  inverse  in  H.  But  drEl  by 
assumption,  so  d~ 1  exists  and  is  an  element  of  G;  thus 
d~x~d~LEH;  hence,  by  (2.2),  dr  E  J.  ■ 

Property  3  ’ :  Let  g**d,~  'n,  with  n,EH  and  dt  £  /. 

a)  If  d,EJ,  then  gEH. 

b)  If  g~d,~xn,  is  a  left  coprime  fractional  representa¬ 
tion  of  g  in  { G,H,I,J ),  then  gEH  implies  that  dtEJ. 

Property  4:  Let  g  *  nrd  ~  xn,  where  nr,  n,  E  H.  and  d  El. 

a)  If  dEJ,  then  gEH. 

b)  Let,  in  addition,  nrd~x  be  a  right  coprime  frac¬ 
tional  representation  in  { G,H,I,J }  and  d~'n,  be  a  left 
coprime  fractional  representation  in  { G,H,1,J };  then  gE 
H  implies  that  dEJ. 

Proof: 

a)  By  assumption,  dEJ;  hence  d~xEH.  So  g- 
nrd~'n,EH. 

b)  Since  d  ~  'n,  is  a  left  coprime  fractional  representa¬ 
tion  there  exist  u,,  v,EH  such  that 

n,u,  +  dv,  —  \,  (2.12) 

thus, 

nrd  ~ 1  -  nrd  ~  x(n,u,  +  do,)  —  nrd~  'n,u,  +  nru,->gu,  +  nro,. 

(2.13) 

Now  gEH  hence  (2.13)  gives  nrd~xEH.  By  Property  3, 
nrd~xEH  together  with  the  fact  that  the  pair  (nr,d)  is 
right  coprime  implies  dEJ.  ■ 

III.  Analysis 

To  start  with  consider  the  feedback  system  of  Fig.  2. 
Suppose  that  the  plant  is  described  by  a  right  coprime 
fractional  representation  p ~nrd~x  in  {G,H,I.J}.  The 

*[G,H,I,J)  satisfies  the  Ore  condition  for  right  fractional  representa¬ 
tions  if,  whenever  f€<7  admits  a  left  fractional  representation  it  also 
admits  a  right  fractional  representation  and  vice  versa.2 


desobr  tt  at.:  feedback  system  design 
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Fig.  2.  Unity  gain  negative  feedback  system. 


Fig.  3.  Feedback  system  with  plant  and  compensator. 


closed-loop  dynamics  of  2p  are  described  by  the  maps 


To  describe  the  feedback  system  2  we  consider  the  map 
«2>-^(e,,e2).  Simple  calculations  give 

(1+pe)-1  -p(\  +  cp)~l 

c{\+pc)~'  (\  +  cp)-' 

(3.7) 


Now  let  hjm:(ul,u1fi-*(yl,y^.  Using  the  summing  node 
equations  it  is  easy  to  see  that 

^ and  h^X-Kh^  (3.8) 
where  K  is  the  symplectic  matrix 


0  1 


-1  0  J 


(3.9) 


hn:  uw*e  ; 

+py'-dr(dr+n,y' 

(3.1) 

h^  :  ut-*y  ; 

hyu  ~p( \+p)~ '  -  nr(dr  +  n,y\ 

(3.2) 

Note  that 

^eu  ™  ^  * 

(3.3) 

We  say  that  2p  is  well  defined  in  G,  (//,  respectively),  if 
h„EG,  (H,  respectively). 

Note  that  the  pairs  (nr,dr  +  nr)  and  (dr,  dr  +  nr)  are  right 
coprime;  indeed,  the  right  coprimeness  of  (nr,dr)  implies 
(2.4),  hence 

(uf-t>,)nr  + «,(</,  + /t,)»l  (3.4) 

while 


It  is  well  known  that  in  the  case  of  multivariable  rational 
matrices,  one  has  to  consider  the  four  submatrices  of  h \m 
in  (3.8)  because  examples  show  that  any  one  of  the 
submatrices  may  be  unstable  while  the  remaining  ones  are 
stable.  (For  detailed  examples,  see  [30].)  Let  us  calculate 

"  ( 1  +PC)~  '  “  1  ~PC(  1  +PC)~  ' 

-l-p(l+cp)'‘c 

- 1  -p[yr\yA + x,nr)dr~ 1  ]  " 1  c 

“  1  ~  nr(ytdr  +  x,nr)  ~ 1  x,  (3.10) 

mdr(yidr  +  x,nr)~'x,  (3.11) 


(vr-u,)d,+ur(d,  +  n,)-l.  (3.5) 

Theorem  1:  Consider  the  feedback  system  2p  of  Fig.  2. 

a)  Let  p m nrd~ 1  be  a  fractional  representation  in 
{  G,H,I,J }  of  the  element  pEG;  then  2p  is  well  defined  in 
G  if  and  only  if  d,  +  nrE  1. 

b)  Let  p «■  n  d~ 1  be  a  right  coprime  fractional  repre¬ 
sentation  in  {G, //,/,/}  of  the  element  pEG;  then  2p  is 
well  defined  in  H  if  and  only  if  dr  +  nrEJ. 

Proof:  a)  =».  hn  E  G  and  dr  E  /  imply 

dr  V -  dr- ‘( 1  +p) - 1 -  dr-  'dr{dr  +  nry'-(dr+nrylGG. 

(3.6) 

Now  cfElcH  and  nrEH,  so  dr  +  nrE H.  This  together 
with  (3.6)  implies  d,  +  nrE  /. 

a)  «■.  dr  +  n,  El  implies  (dr  +  nr)~]  EG;  hence  hn - 
d,(dr  +  nr)~'  EG. 

b)  Follows  from  Property  3,  together  with  (3.4)  and 

(3.5).  ■ 

Of  course,  a  similar  theorem  holds  for  left  factoriza¬ 
tions. 

We  now  consider  the  feedback  system  2  of  Fig.  3  where 
the  plant  p  is  preceded  by  a  compensator  c;  p  and  c 
belong  to  G  and  are  specified  by  their  coprime  fractional 
representation  in  (G,H,I,J)  “d  y,~ lx,,  respec¬ 

tively. 


+  cp)~ '  "0  +yt~ ' x,nrd~ ')" ' 
-[yrl(yidr+x,nr)d-']~l 
mdr(y&  +  x,nrylyl  (3.12) 

+  -nt(y,dr^x,nryxy,.  (3.13) 

We  say  that  2  is  well  defined  in  G,  (H,  respectively)  if  and 
only  if  each  entry  of  hn  defined  in  (3.8)  belongs  to  G,  (H, 
respectively). 

Theorem  2:  Consider  the  feedback  system  2  of  Fig.  3. 
Let  nrd~ 1  and  y,~  be  a  right  and  left  fractional  repre¬ 
sentations  of  p  and  c  in  (G, H,I,J). 

a)  If  y,dr  +  x,nr  e  /,  then  2  is  well  defined  in  G. 

b)  If  y,dr  +  xtnrEJ,  then  2  is  will  defined  in  H. 

c)  If  htiUt  E  G,  then  y,dr  +  x,nr  e  /  hence  if  2  is  well 
defined  in  G,  then  y,dr  +  x,nr  E  /. 

d)  Assume,  in  addition,  that  nr(y,dr)~ 1  and  (y,dr)~'x, 
are  right  coprime  and  left  coprime  fractional  representa¬ 
tion,  respectively;  then  htiU  EH  implies  that  y,dr  +  x,nrE 
J,  and  hence,  if  2  is  well  defined  in  H,  then y,dr  +  x,nrEJ . 

Proof:  a)  and  b).  If  y,dr  +  x,nr  E  /,  ( J ,  respectively), 
then  by  the  definition  (2.1)  of  /,  [(2.2)  of  J,  respectively], 
the  formulas  (3.10)— (3.13),  and  the  closure  of  the  ring  G, 
(H,  respectively),  the  conclusion  follows. 
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c)  If  h'^EG,  then  so  is  d~ \^y{~1  since  drei  and 
y,  S I.  Now, 

4"  1  -  d~  ‘(1  +  cp) _  v 1 

-  d~ '( I  +y,'  xx,nrd~ ')  “  V 1 
*  dr '  [  >r'(M  +  x,nr)d~ 1  ]  y,~ 1  -  (y,dr  +  x,nr)  ~ 1 

(3.14) 

hence  the  fact  that  h,^  £  G  implies  that  (y,d,  +  x,nr)~x  e 
G  and  thus  ( y,dr  +  x,nr)  £  /. 

d)  First  we  prove  that  the  pair  ( nr,y,dr  +  x,nr)  is  right 
coprime.  Since  ( nr,y,dr )  is  right  coprime,  there  exists  ur 
and  6reH  such  that 

urnr  +  vry,dr  -  1 ;  (3.15) 

hence 

(“,  “  orx,)nr  +  vr(y,dr  +  x,nr)  -  1  (3.16) 

and  the  claim  is  established.  Similarly,  we  show  that 
(y,dr  +  x,nr,x,)  is  left  coprime.  Now  consider 

h.,u,  “  1  -  nr(y,dr  +  X,nr)  ~'x,.  (3.17) 

By  assumption,  A,  Wi  €  H\  then  the  special  assumption  of 
d)  and  Property  4  imply  that  y,dr  +  x,nrej.  This  com¬ 
pletes  the  proof.  ■ 

Note,  the  special  assumptions  used  in  d)  to  the  effect 
that  nr(y,dr)~l  is  right  coprime  and  (y,dr)~lx,  is  left 
coprime,  imply,  in  some  sense,  that  p  and  c  have  no 
common  factors.  More  precisely,  since  J  serves  as  the 
group  of  units  in  our  theory  these  conditions  imply  that 
any  common  factors  of  p  and  c  must  lie  in  J. 

IV.  Design 

Consistent  with  our  approach  of  matching  the  plant 
model  to  the  goal  of  the  given  feedback  system  design 
problem  the  present  section  is  devoted  to  the  problem  of 
characterizing  the  set  of  compensators  which  will  “place” 
a  feedback  system  in  a  prescribed  ring  H  given  that  both 
the  plant  and  compensator  are  modeled  by  fractional 
representations  in  {G,H, /,/,}. 

Theorem  3:  For  the  feedback  system  2  of  Fig.  3,  let  the 
plant  p  have  a  right  coprime  and  a  left  coprime  fractional 
representation p  —  nrd~ 1  —  d,~ xn,  in  {G, //,/,/).  Let  u,  and 
c,  both  in  H  be  such  that  (2.4)  holds.  Then  for  any  weH 
such  that  mot, +  o,6/,  the  compensator 

c  »  (wn,  +  o,)  - '( -wd,+  ur)  6  G  (4.1) 

results  in  a  feedback  system  2  well  defined  in  H.  For  such 
a  compensator,  h„  €  H2x2  and 

h  .[  1-M -*'<*/  +  «»•)  -«,(*w»/  +  t>,) 

“  [<(-W,  +  «,)  d,(wn,  +  u,)J-  K  ) 

Conversely,  if  2  is  well  defined  in  H  and  if  the  compensa¬ 
tor  c  -y,~  xx,  is  such  that  ( nr,y,d, )  and  (y,dr,x,)  are  right 


coprime  and  left  coprime  respectively,  then  c  is  given  by 
expression  (4.1). 

Proof- 

Step  1:  Choose  any  kEJ,  (hence  k~lGH),  and 
solve  for  y,  and  x,G  H  the  equation 

y,dF  +  x,nr-k.  (4.3) 

Observe  that  if  {yhx,)  is  any  solution  in  H  of  (4.3),  then 

k~'(y,dr)  +  k-'(x,nr)- 1  (4.4) 

and 

(M)*  _l  +  (xtnr)k  “'-I,  (4.5) 

hence,  ( nr,y,dr )  is  right  coprime  and  (y,dr%xt)  is  left 
coprime.  Thus,  the  assumptions  of  Theorem  2,  part  d) 
holds  for  any  solution  of  (4.3). 

Step  2:  Obtain  all  solutions  of  the  homogeneous 
equation 

yfo  +  xfn,-  0.  (4.6) 

Since  p  —  nrdr~ 1  -  d,~  Vi,,  direct  calculation  shows  that  for 
any  rE  H, 

_y  *  a*  rn,  x^-rd,  (4.7) 

are  solutions  of  (4.6). 

It  remains  to  show  that  all  solutions  of  (4.6)  are  of  the 
form  (4.7);  so  we  assume  that  y*  and  x,h  £  H  and  satisfy 
(4.6).  Let  r—  -  x,V,~ hence 

x?-rd,.  (4.8) 

Now  using  (4.6) 

y,k •yfdrd~ 1  -  -  x,\d~ 1  -  -  xfp 

-  -  x,hdr  'n,  -  rn,.  (4.9) 

Equations  (4.8)  and  (4.9)  show  that  any  solution  of  (4.6) 
has  the  form  of  (4.7);  it  remains,  however,  to  show  that 
rSH, 

r~  ~  x,hdr 1  -  -  x,Hdr  l(d,v,  +  n,u,) 

-  -  xj'v,  -  xkd,~  'n,u,  -  -  xfv,  +  yfu,  G  H.  (4. 10) 

Step  3:  Obtain  a  particular  solution  of  (4.3).  From 
the  right  coprimeness  condition  for  (nr,dr), 

kvrdr  +  kurnr  “  k  (4. 1 1 ) 

hence 

yf m  kvr,  xf  »  kur.  (4.12) 

Hence  any  solution  of  (4.3)  is  of  the  form 
y,^m,  +  kvr 

x, ■  -  rd,  +  kut  for  some  reH  (4. 1 3) 

and  for  any  such  solution  ( nr.y,dr )  is  right  coprime  and 
(y,dr,xt)  is  left  coprime. 
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Step  4:  Consider  the  condition 
rEH  and  kEJ  such  that  rn,  +  kvr E /  (4.14) 
or  equivalently,  if  we  set  w  —  k~'rEH, 

wEH  such  that  wn,  +  v, E /.  (4.15) 

If  (4.15)  holds, 

c^(wn,  +  vr)~l(  —  wd,+  u,)E  G  (4.16) 

is  a  compensator  in  G  which  can  also  be  written  as  [see 
(4.13)1 

c  -  («!,  +  *»,)  ~'(  -rd,  +  kur).  (4.17) 

If  we  let  y,  *  rn,  +  kvr  and  x,  —  —  rd,  +  kur,  then,  by  (4.17), 
c—  y,~{x  and,  by  calculation,  we  verify  that  (4.3)  holds. 
Thus  for  any  such  compensator,  by  Theorem  2,  the  feed¬ 
back  system  2  is  well  defined  in  H. 

Step  5:  Conversely  consider  a  feedback  system 
well  defined  in  H  with  a  compensator  c  ~y,~  lx,  such  that 
(nr,y,dr)  and  (y,dr,x,)  are  right  coprime  and  left  coprime, 
respectively.  By  Theorem  2,  (4.3)  holds  for  some  kEJ, 
hence  by  the  analysis  above,  c  is  also  given  by  (4.1)  for 
some  wEH  such  that  wn,  +  vr £ /.  The  proof  is  thus  com¬ 
plete.  ■ 

The  theorem  yields  a  complete  parameterization  of  all 
possible  controllers  which  will  place  a  plant  in  H  given  the 
existence  of: 

1)  right  and  left  coprime  fractional  representations  of  p 
and 

2)  a  w  in  H  for  which  ( wn,  +  vr)  is  in  /. 

In  the  multivariable  case  where  p  is  a  square  matrix 
whose  elements  are  proper  rational  functions  it  is  well 
known  that  p  has  left  and  right  coprime  fractional  repre¬ 
sentations  [19].  In  order  to  obtain  a  proper  controller  one 
has  to  choose  w  in  (4.1)  so  that  det( w(s)n,(s)  +  cr( j)]  ^0  at 
infinity.  Methods  for  obtaining  such  a  proper  stabilizing 
controller  have  been  reported  in  [32]  and  [33],  Alterna¬ 
tively,  one  can  verify  the  existence  of  such  a  w  in  our 
algebraic  setting  by  invoking  the  fact  that  n,  and  dr  are 
right  coprime  and  applying  linear  algebraic  arguments 
thereto.  Of  course,  these  arguments  apply  to  distributed 
systems  as  well  as  lumped  systems  using  the  formulation 
of  [7]  and  [8], 

In  the  most  general  ring  theoretic  setting  neither  right 
nor  left  coprime  fractional  representations  of  p,  nor  a  w 
such  that  (wn  +  vr)  is  in  /,  are  assured  to  exist.  At  present, 
the  only  known  counterexample  to  the  latter  is,  however, 
in  the  ring  of  integers  which  is  of  no  system  theoretic 
interest. 

Conditions  I)  and  2)  have  been  conjectured  to  be  both 
necessary  and  sufficient  conditions  for  the  existence  of  a 
compensator,  c,  which  places  the  feedback  system  in  H 
[3].  In  fact,  if  c  places  the  feedback  system  in  H,  then 
from  (3.7)  we  obtain  left  and  right  fractional  representa¬ 
tions 

Pm(-  " '  “  (*.,«,)  " (4. 1 8) 


Note  that  there  is  no  guarantee  that  these  fractional 
representations  are  coprime.  These  representations  are, 
however,  coprime  when  the  compensator  is  in  H.  Indeed, 
in  that  case  they  satisfy  a  stronger  condition  which  com¬ 
pletely  characterizes  those  plants  which  can  be  placed  in 
H  by  a  compensator  in  H.  For  an  early  analogous  result, 
see  [10,  pp.  85-87]. 

Corollary  t:  For  the  feedback  system  2  of  Fig.  3  there 
exists  a  c  in  H  which  places  the  feedback  system  in  H  if 
and  only  if  p  admits  left  and  right  fractional  representa¬ 
tions  p  -  d,~  'n,  -  nrd~ 1  such  that  nr  is  a  right  factor  of 
1  -  d,  and  n,  is  a  left  factor  of  1  —  d,. 

Proof:  If  the  feedback  system  is  placed  in  H  by  a  c  in 
H  it  admits  the  fractional  representations  of  (4.18).  By 
calculation  [see  (3.7)] 

(4.i9) 

and 

(4.2°) 

which  verifies  their  coprimeness  since  c  is  in  H.  Moreover, 
upon  rearranging  the  terms  in  (4.19)  and  (4.20)  the  condi¬ 
tions  of  the  corollary  follow.  Conversely,  if  fractional 
representations  exist  which  satisfy  the  conditions  of  the 
corollary  there  exists  ur  in  H  such  that 

u,n,-d,=  1  (4.21) 

(equivalently  p  =  nrd~ 1  is  a  right  coprime  fractional  repre¬ 
sentation  with  vr  *=  1).  Now,  by  using  this  right  fractional 
representation  in  (4.1)  (with  any  left  coprime  fractional 
representation)  and  w  «  0  we  obtain  a  compensator  c  «  ur 
in  H,  which  places  the  feedback  system  in  H.  ■ 

V.  Examples 

Example  1:  A  Single  Variate  Servomechanism  Problem 4 

Here  G  is  the  ring  of  proper  rational  functions  and  H  is 
subring  of  functions  analytic  in  Rer  >  - 1.  Consider  the 
problem  of  designing  a  compensator  for  the  unstable 
plant  p(r)«»(r+  l)/(s2- 4)  which  will  simultaneously 
place  the  poles  of  the  feedback  system  in  the  region, 
Re(j)<  -  1,  and  cause  the  system  to  asymptotically  track 
a  step  input.  Since  our  transfer  functions  are  commutative 
we  may  adopt  common  right  and  left  fractional  repre¬ 
sentation  for  p(s).  In  particular. 


(•r+1) 

(j2-4) 


(*+l) 

[(*"2)1 

,(S  +  2)\ 

[(*  +  2)  j 

■/t(s)d(j)"' 

(5.1) 


ri6][  (J+I)]  [(j  +  2/3)1[(j-2) 

l  3  J[  (r  +  2)2  C*  +  2)  ][(r  +  2) 


-u(r)/i(j)  +  c(j)d(j)- 1.  (5.2) 


4The  purpoM  of  this  example  is  merely  to  give  a  simple  illustration  of 
the  theory,  tn  this  situation,  a  much  more  highly  developed  theory  is 
available  in  [29]. 
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Here,  each  of  the  four  rational  functions,  n(s),  d(s),  u(s).  Since  tl 
and  v (s),  lie  in  the  ring  of  operators  with  poles  in  the  with  an  e 
region  Re(s)  <  -  1  and  hence  the  set  of  all  compensators  constraint 
which  will  place  the  feedback  system  in  this  ring  is  given  zeros  at  : 
by  Theorem  3  with  w(s)  also  in  the  ring.  Moreover,  for  an  noise  sour 
arbitrary  *v(s)  the  input-output  mapping  for  the  resultant  that  the  a 
feedback  system  will  take  the  form  make  hy^i 

r  additional 

l  (s\m-  ill  /,\.  16(j+  1)  order  m»(j) 

W  )  (s  +  2)3  PS)  L  3(^  +  2)2  j 

■  —  n(s)d(s)w(s)  +  /i(s)u(s).  (5.3) 

By  the  final  value  theorem  the  feedback  system  will  Here,  by 
asymptotically  track  a  step  input  if  and  only  if  1^,(0)  *  1  we  assure 
(equivalently  c(s)  has  a  pole  at  zero).  As  such,  to  simulta-  parameter 
neously  place  the  poles  of  the  feedback  system  in  the  to  achieve 
region,  Re(j)<  - 1,  and  cause  the  feedback  system  to  ~1.  Subsi 
asymptotically  track  a  step  input  we  must  find  a  w(j)  with 
poles  in  this  region  such  that  /t^/0)*  1.  Evaluating  (5.3)  ^  (5): 


Since  the  previous  compensator  design  was  achieved 
with  an  especially  simple  w(s)  let  us  add  an  additional 
constraint  to  the  problem  by  requiring  that  A^/s)  have 
zeros  at  ±j  (so  that  the  system  will  be  insensitive  to  a 
noise  source  at  that  frequency).  Now,  from  (5.4)  it  follows 
that  the  above  design  is  the  only  compensator  which  will 
make  /^,  (0)*  1  with  a  constant  w(r);  hence  to  satisfy  this 
additional  design  constraint  we  will  work  with  the  first 
order  w(s)  in  the  form 

•*>-!£■  (5-7) 

Here,  by  specifying  the  zeroth-order  coefficients  of  w(s) 
we  assure  that  w( 0)*  -4/3  while  we  are  left  with  the 
parameters  a  and  b  to  create  the  required  zeros.  Of  course, 
to  achieve  our  stability  condition  we  must  have  -3 /b< 
—  1.  Substituting  the  w(s)  of  (5.7)  into  (5.3)  yields 


at  5*0  and  setting  it  equal  to  one  yields 

<5-4> 

implying  that  w(0)*-4/3.  As  such,  the  simplest  w(s) 
which  will  achieve  our  simultaneous  goals  is  the  constant 
h>(j)~  -4/3  whose  poles  are  trivially  in  the  prescribed 
region.  Adopting  this  w(s),  a  little  algebra  with  the  expres¬ 
sions  of  Theorem  3  will  reveal  that  the  required  com¬ 
pensator  takes  the  form 

while  the  input-output  mapping  for  the  feedback  system 
takes  the  form 

(5t.l)(205,_24) 

3(s  +  2)3 


(s+l)[(16b-3a)s2  +  (60+6a  +  32b)s  +  72] 
3(s  +  2)3(bs  +  3) 


To  obtain  the  desired  zeros  at  5  —  ±j  the  equation 
[(16b-3a)s2  +  (60+6a  +  32b)s  +  72]  =  A[s2+l] 


must  be  satisfied.  Now,  this  represents  three  linear  equa¬ 
tions  in  three  unknowns  and  has  the  unique  solution 

a- -  17,  b=^,  and  fc-72.  (5.10) 

16 

Moreover,  -3/b*  -  16/7  <  -  1;  hence  this  choice  of 
w(s)  will  also  assure  the  prescribed  degree  of  stabilization. 
As  such,  we  take 


-(175+4)  —  (2725  +  64) 

(215/16  +  3)  ”  (215  +  48) 


(5.11) 


Clearly,  c(s)  has  the  required  pole  at  zero  (for  A^/0)* 1), 
although  it  is  by  no  means  obvious  that  this  quasi-stable 
compensator  will  transfer  the  unstable  poles  of  p(s )  to  the 
prescribed  region.  Indeed,  this  illustrates  the  underlying 
power  of  the  proposed  design  technique  in  that  when  one 
designs  the  system  in  terms  of  w(s)  rather  than  c(5)  the 
pole  placement  or  stabilization  process  is  automatically 
resolved  by  working  with  a  M>)  whose  poles  lie  in  the 
prescribed  region  while  the  remainder  of  the  design  pro¬ 
cess  is  simplified  by  the  affine  relationship  between  w(s) 
and  the  matrices  hn  and  h^.  Finally,  we  note  that  c(5)  has 
a  zero  at  s  m  -  2  which  may  cancel  with  the  pole  of  p(s)  at 
5  *  -  2.  This,  however,  does  not  contradict  the  coprime- 
ness  assumptions  of  Theorem  3  since  the  common  factors 
involved  lie  in  J  which  serves  as  the  group  of  units  in  our 
theory.  Fortunately,  such  common  factors  can  never  lead 
to  an  erroneous  design  since  by  assumption  the  poles  and 
zeros  of  the  rational  functions  in  J  lie  in  the  prescribed 
region.  As  such,  any  cancellations  which  may  take  place 
are  benign. 


which  yields 

128(s  +  2)(s2+  1) 
(7s2-  56s -60)5 
and 

(s  +  2)  (21s  +  48) 

satisfying  all  of  our  design  criteria. 

Example  2:  A  Multivariate  Lumped- Distributed 
Decoupling  Problem 


(5.12) 

(5.13) 


Consider  the  multivariate,  lumped-distributed  plant 


(s-\) 

(5+1) 

_ 1 

(5-1) 


(5.14) 
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which  we  desire  to  stabilize  and  simultaneously  decouple 
by  feedback.  For  most  lumped-distributed  systems  one 
can  take  H  to  be  a  ring  of  matrices  whose  elements  lie  in 
the  algebra  k_  (a0)  of  stable  transfer  functions  generated 
by  lumped  elements  and  delays  while  G  is  a  ring  of 
matrices  whose  elements  lie  in  ^(oq),  the  algebra  of 
quotients  of  elements  in  <£_(0o),  as  per  Table  I.  In  our 
case,  however,  although  e~x/l  is  Z^-stable  (since  it  is 
analytic  on  the  right  half-plane  and  bounded  on  the 
imaginary  axis  [10])  it  has  a  “nasty”  singularity  at  j—0 
and  hence  does  not  lie  in  &(o0)  for  any  o0  <  0.  As  such,  we 
take  H  to  be  a  ring  of  2x2  matrices  whose  elements  are 
transfer  functions  lying  in  the  Hardy  space  //*(/?)  of 
functions  which  are  (essentially)  bounded  on  the  ju  axis 
and  admit  an  analytic  extension  into  the  right  half-plane 
(thereby  making  them  Z^- stable)  [12].  Similarly,  we  let  G 
be  a  ring  of  2x2  matrices  whose  entries  are  transfer 
functions  lying  in  the  Lebesgue  space  LX(R)  [12].  With 
this  setup  /  becomes  the  set  of  Hx  functions  which  are 
uniformly  bounded  below  on  the  ju  axis  while  J  is  the  set 
of  Hx  functions  whose  analytic  extension  is  uniformly 
bounded  below  in  the  right  half-plane  [12].  Equivalently,  J 
is  the  set  of  invertible  outer  functions  in  Hj^R)  [12]. 

Using  these  spaces  a  little  algebra  will  reveal  that  p(s) 
has  the  right  and  left  coprime  fractional  representations  in 
{G,H,J,J}  shown  below: 


/>(*) 


P(s) 


'  e-\/> 

(s-1  f  ' 

1 

o 

(J+ 1) 

(s+1)2 

0  1 

n 

(j-i) 

(r+1) 

(r+l) 

- nr(s)d,(s )"  (5.15) 


1  0 

-  1 

‘  e-W 

(j-1) 

(*+ 1) 

(*  +  D 

o  (J_1) 

0 

1 

(5+1). 

(j+1) 

md,(s)  ln,(s) 


(5.16) 


where 


0  0 

e~l/t  (j-l)J 

1  0 

I  0 

(*+l)  (j+1)j 

+ 

0  2 

0  1 

0  1 

0  (,_1) 

(j+1) 

(j+1) 

-  u,(s)n,(s)  +  vr(s)dr(s)  - 1  (5.17) 

and 


Upon  substitution  of  these  matrices  into  the  expression 
for  from  Theorem  3  one  obtains 


W4)"- 


'  e-./* 

(*-D2  " 

'*'n(J)  wn(s) 

(s+  1) 

(s+  D2 

0  (J+1) 

H'2i(J)  wn(s) 

1  0 

fo  1 

+ 

(r+1)2 

o 

0  7— — — r 

(s+  1) 

(j+1) 

(5.19) 


which  will  be  stable  if  and  only  if  the  w^s)  are  stable. 
Now,  to  decouple  the  system  we  require  that 


<«+!)’ 


+  (5.20) 

(■>+!)  (j+l)2 


and 

*<J)'(7TT)-!'(I)‘°'  <5'2I) 


Dearly,  w2,(j)»0  solves  (5.21).  On  the  other  hand  (5.20) 
has  numerous  solutions  none  of  which  are,  however,  sta¬ 
ble.  As  such,  the  system  cannot  be  decoupled  and  stabi¬ 
lized  simultaneously.  Note,  since  our  theory  guarantees 
that  all  stable  feedback  systems  with  plant  p(s)  take  the 
form  of  (5.19)  if  we  cannot  find  stable  w’s  which  decouple 
(5.19)  we  are  assured  that  it  is  impossible  to  simultaneously 
stabilize  and  decouple  p(s)  by  feedback  (using  a  com¬ 
pensator  as  specified  in  Theorem  3)  and  we  need  not 
consider  other  formulations. 

Since  we  cannot  simultaneously  stablize  and  decouple 
p(s)  by  feedback  the  best  we  can  do  is  to  try  to  stabilize 
p(s)  while  preserving  its  triangularity  (which  will  allow  us 
to  sequentially  adjust  its  various  outputs).  Formally,  this 
can  be  achieved  by  taking  w(s)—0  which  yields  the  in¬ 
put-output  mapping 


2(s-l)2 
(s+  l)2 
2 

(s+  1) 


(5.22) 


Unfortunately,  the  first  input  has  been  rendered  useless 
by  this  compensator  and  hence  the  goal  of  being  able  to 
sequentially  tune  the  outputs  is  not  achieved.  On  the  other 


e~'/‘ 

(r+1) 

0 


H) 

(r+l) 

1 _ 

(j+1) 


0  0 

1  0 

+ 

0  2 

0 

(j+1) 

-2(J-Q 

(*+l) 

1 


•  n,(s)u,(s)  +  d,(s)v,(s)~  1. 


(5.18) 
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hand,  if  we  take 

?]• 

then 

e~l/l  2(j  —  l)2 
O+D  <44-l>!  (5.24) 

0  o+l) 

which  has  the  desired  property  is  obtained.  In  particular, 

one  can  tune  the  second  input  to  control  the  second 
output  and  then  adjust  the  first  input  to  simultaneously 
cancel  out  the  effects  of  the  second  input  on  the  first 
output  and  control  the  first  output.  Of  course,  since  w(s) 
is  stable  so  is  ^(s). 

Finally,  we  note  that  as  we  have  formulated  our  theory 
one  can  deal  only  with  square  matrices  (since  rectangular 
matrices  are  not  closed  under  multiplication).  The  exten¬ 
sion  to  rectangular  matrices  is,  however,  straightforward 
[19]  and  yields  an  identical  theory  the  details  of  which  are 
left  to  the  reader. 

Example  3:  A  Multidimensional  Image  Restoration  Problem 


As  such,  the  set  of  all  possible  FIR  transfer  functions 
which  can  be  obtained  from  p(z ,,22)  by  feedback  takes 
the  form 

--[*?  +  2^2  +  2,21  +  32,+  3  z2  ] 

5[*\+*i2i]  (5-28) 

where  w(z,,zf)  is  an  arbitrary  polynomial  in  two  variables. 
Gearly,  w(z,,zj  should  be  low  order  to  keep  the  “point- 
spread  function”  of  hy^fz^zf)  as  concentrated  as  possi¬ 
ble.  Indeed,  if  we  take  w(z„22)»0  we  obtain 

(5-29) 

in  which  the  response  from  a  given  pixel  effects  only  two 
adjacent  pixels.  Note  that  the  fact  that  these  pixels  are  not 
centered  around  the  input  point  does  not  cause  any  diffi¬ 
culty  since  one  can  always  shift  the  origin  of  the  raster  to 
compensate.  Taking  this  wfz^zj)  we  obtain  the  simple 
compensator  c(z„zf)  —  -  z,  which  represents  a  one  direc¬ 
tional  shift  and  a  180°  phase  shift. 

An  alternative  design  which  also  yields  a  “point-spread 
function”  which  affects  only  two  pixels,  although  it  is 
shifted  further  from  the  origin,  is  obtained  with  wiz^zj** 
—  (l/9)z,.  This  yields 


P(z  1^2) 1 


*1  +  *2 
z2  +  z,z2+3 


(5.25) 


W*»'*i)"i[*»  +  2z»z*]  (5-30> 


denote  the  discrete  two-dimensional  transfer  function  for 
a  device  in  a  digital  image  processing  system.  Since  this 
represents  an  IIR  (infinite  impulse  response)  transfer 
function  the  image  processing  device  will  tend  to  “smear” 
the  image  with  the  data  observed  at  any  one  pixel  distort¬ 
ing  all  other  pixels  at  the  output  of  the  device.  In  an  effort 
to  reduce  this  “smearing”  effect  we  would  like  to  place  the 
device  in  a  feedback  system  whose  input-output  transfer 
function  minimizes  the  “smearing”  effect  In  particular, 
that  means  that  the  input-output  mapping  for  the  feed¬ 
back  system  should  have  an  FIR  (finite  impulse  response) 
transfer  function  with  its  “point-spread  function”  con¬ 
centrated  about  a  single  point  as  closely  as  possible. 

Since  the  FIR  transfer  functions  are  just  the  polynomi¬ 
als  we  let  H  be  the  ring  of  polynomials  in  two  variables 
and  G  be  the  ring  of  rational  functions  in  two  variables 
[16].  Once  again  employing  only  a  single  fractional  repre¬ 
sentation  since  these  rings  are  commutative  we  obtain  the 
coprime  fractional  representation 

"fl(*i»*2)<*(*„*j)~l  (5.26) 

where 

[“  J*t][*i+*a]  +  [  }][*i  +  *.*a+3] 

-  u(z„Z2)n(z„Zj)-»-c(Z|Zj)d(z„Zj)- 1. 

(5.27) 


c(*|,*2)“ 


22\{Z,  +  *2) 

z?  +  z,z2  +  3 


(5.31) 


Since  two-thirds  of  the  output  energy  in  this  design  is 
concentrated  at  a  single  point  whereas  the  energy  is  equ¬ 
ally  divided  in  the  previous  design  it  may  be  argued  that 
this  represents  a  superior  design.  On  the  other  hand,  the 
shift  from  the  origin  is  greater  and  the  compensator  more 
complex  in  this  case.  Finally,  since  all  FIR  transfer  func¬ 
tions  are  stable  (in  an  appropriate  sense)  the  feedback 
systems  obtained  via  either  choice  of  wfzpz^  are  stable. 
Moreover,  both  compensators  are,  themselves,  stable  as  is 
p{z„zj  [6], 


Example  4:  A  Time-Varying  Differential-Delay 
Stochastic  Optimal  Control  Problem 

Consider  the  feedback  system  of  Fig.  4  where  the  plant 
represents  a  cascade  of  a  time-varying  function  /  with  an 
ideal  predictor  e\  The  system  is  driven  by  a  stochastic 
process  a,  which  is  derived  from  white  noise  by  passing  it 
through  a  miniphase  filter  with  transfer  function  (s  +  2) /(s 
+ 1).  We  desire  to  choose  a  compensator  which  will 
stabilize  the  system  and  minimize  the  performance 
measure 

y-£l|*H2+ £11^11*  (5.32) 

under  the  constraint  of  stability.  Here,  d  is  the  stochastic 
process  observed  at  the  output  of  the  system,  b  is  the 
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Fig.  4.  Stochastic  control  system. 


Fig.  5.  Open-loop  optimization  problem. 


It  is  significant  to  note  that  even  though  we  are  inter¬ 
ested  in  designing  an  optimal  closed-loop  system  by  mini¬ 
mizing  over  the  operator  w  rather  than  the  compensator 
we  have  transformed  the  problem  into  the  open-loop 
optimization  problem  of  Fig.  5. 

Here  we  desire  to  minimize  J^E\\e\\2  over  all  causal 
operators  w,  where  g,,  g2,  and  g3  are  arbitrarily  specified 
bounded  operators.  In  our  case  we  take 

*. -[*"][/"]  (5-38) 

-1  _ 

and 


stochastic  process  observed  at  the  plant  input,  and  £  is 
the  expected  value  operator. 

Since  we  have  a  time-varying  component,  a  rational 
component,  and  a  delay  component  we  formulate  our 
theory  in  an  abstract  operator  theoretic  setting  [20]  with  G 
taken  to  be  the  bounded  operators  on  the  Hilbert  space 
L2(R)  and  H  taken  to  be  the  causal  bounded  operators 
(which  correspond  to  the  stable  systems  in  such  a  setting) 
[20],  [23],  Note,  in  this  setting  we  will  denote  the  time-in¬ 
variant  operators  by  their  transfer  function  and  the  time- 
varying  multiplication  operators  by  their  characteristic 
function.  Of  course,  one  must  be  careful  with  such  nota¬ 
tion  since  the  operational  calculus  associated  with  the 
time-invariant  components  is  only  partially  valid  in  such  a 
setting. 

Since  the  inverse  of  a  predictor  is  the  ideal  delay  which 
is  causal  one  immediately  obtains  the  right  and  left 
coprime  fractional  representations  for  p  in  the  form 

(5.33) 

where 

+  +  (5-34) 

and 


in  which  case  the  output  of  the  open-loop  system  is 
e  —  (d,  b)  in  the  product  space  constructed  from  two  copies 
of  the  (Hilbert)  space  on  which  the  given  system  is  de¬ 
fined.  Now,  if  we  take  the  a  in  our  open-loop  problem  to 
coincide  with  the  given  a  in  the  closed-loop  optimization 
problem  then  the  Pythogorean  law  (in  Hilbert  space) 
implies  that 

y-£||e||2-£||d||2+e||6||2.  (5.41) 

As  such,  our  two  optimization  problems  coincide. 

Interestingly,  an  explicit  solution  has  recently  been 
given  for  the  above  open-loop  optimization  problem  [9]. 
Indeed,  the  optimal  causal  w  is  given  by 

w0  -  k  - 1  [  A*  -  'gUiQagl  ~ '  ]  c*  " 1 '  (5.42) 

where  A  and  9  are  causal,  causally  invertible  operators 
such  that 

A'A-gJgj  **-g,&* f-  (5-43) 


[,][,]  +  [*"y’,][°]-'Wr  +  ^-»-  (5-35) 

Here,  we  have  assumed  that  /~l  exists  and  is  bounded 
(i.e.,  /  is  bounded  away  from  zero)  while  /  and  f~'  are 
both  causal  since  multiplication  by  a  function  of  time  is  a 
memoryless  operation  [20].  From  Theorem  3  it  now 
follows  that  the  input-output  and  input-plant  input  map¬ 
pings  for  our  feedback  system  with  compensator  defined 
by  a  causal  operator  w  will  take  the  form 

<5J6> 

and 

*v--[.-]»[,-][r,]+[«-.][r,].(5j7) 

As  such,  our  optimization  problem  reduces  to  choosing 
the  causal  w  which  minimizes  the  performance  measure  of 
(5.32)  where  d-  a  and  b  m  htit  a. 


Q,  is  the  covariance  for  the  stochastic  processes  a,[  ]c 
denotes  the  causal  part  of  an  operator,  and  denotes 
the  adjoint  operator.  To  apply  this  general  theory  to  our 
example  we  represent  the  adjoint  operation  when  applied 
to  a  transfer  function  by  g(r)*— g(-r)  which  coincides 
with  the  classical  adjoint  on  the  ju  axis.  Of  course,  the 
memoryless  multiplication  operators,  [/]  and  [/“*],  are 
self  adjoint  Finally,  since  a  is  the  stochastic  process 
generated  from  white  noise  by  passing  it  through  the  filter 
(j  +  2)/(r  +  l) 


o  .  (J+2)  If  (j+2)  j*_  (j  +  2)(j  — 2) 

[(j+1)J[(j+I)J  (j+1)(j-1)  ’ 


(5.44) 


First  we  calculate  A  and  9  via 

A*A-[  ~f\  -e']  ..7.!..  -/*+ 1  (5.45) 


410 


(EBB  TRANSACTIONS  ON  AUTOMATIC  CONTROL,  VOL.  AC-25,  NO.  3,  JUNE  1980 


(5.46) 


■[r'W!^rfg±a]M 


(5-49) 


whose  causal  part  must  now  be  computed.  Recalling  that 
the  memoryless  term  factors  through  the  causal  part 
bracket  [9]  it  suffices  to  compute  the  causal  part  of  the 
time-invariant  system  with  transfer  function 


g(s) -  Millie*  . 

,W  (j  +  1) 


(5.50) 


Taking  the  inverse  Laplace  transform  we  obtain  the  im¬ 
pulse  response  of  this  system  in  the  form 


*(/)-«(/+ l)  +  e(,+1)t/(f+l) 


(5.51) 


Here  /_,(/)»■,/(/- 1)  and  we  have  used  the  properties  of 
the  delay  and  predictor  to  obtain  the  equalities 
"LCi'Ie-*]  and  [/“‘le'J—le'I/ril.  Of  course,  the  ex¬ 
ponential  transfer  functions  commute  with  the  rational 
transfer  functions  allowing  the  cancellation  of  the  ex¬ 
ponential  terms  in  (5.46).  From  (5.45)  and  (5.46)  one  may 
now  readily  obtain  the  required  causal,  causally  invertible 
A  and  9  operators  in  the  form 

A-A*-V/2+1  andA-'-A*-1- - l- -  (5.47) 

yff+ 1 


(5.48) 

The  next  step  in  evaluating  (5.42)  is  to  compute  the 
term  in  the  bracket,  i.e., 

.[  (J  + 2)(r-2)  1  r  /.||r  Of  ,  (J~l) 

(s+l)(s-l)P  JLe  J  (,-2) 
— L-fW-»i[j£±afe=ai 

V^TL  1  (r-H)(r-  1) 


where  S  is  the  Dirac  delta  function  and  U  is  the  unit  step 
function.  Now,  the  causal  part  of  g(t)  is  obtained  by 
setting  g(t)  to  zero  for  t  less  than  zero;  hence 

[s(t)]c-g(t)U(t)-e^"V(t)--e‘U(t)  (5.52) 


or  equivalently 


[*(*)] 


c  e(r  + 1) 


(5.53) 


Multiplying  through  by  the  memoryless  factor  from  (5.49) 
we  then  obtain 


(5.34) 


and  finally 


"'o-*",[A*',8?S2Q„8r0*"V 


y/fTT 


[r']yfF+i 


1[  1  |[(s-H)]r  1 

'  e[  (r+1)  J[  (s  +  2) 


(5.55) 


which  is  surprisingly  simple  given  the  complexity  of  the 
derivation. 

Substituting  the  expression  of  (5.55)  into  the  formula  of 
Theorem  3  now  yields  an  expression  for  our  optimal 
compensator  and  the  input-output  mapping  for  the  re¬ 
sultant  feedback  system  in  the  form 


c-«[/r,,](s  +  2)-l 


e(s  +  2) 


(5.56) 


(5.57) 


Note  that  hy^i  is  stable,  as  required,  even  though  both  p 
and  c  are  unstable. 


VI.  Nonunear  Feedback  Systems 

From  an  algebraic  point  of  view  the  fundamental  dif¬ 
ference  between  linear  and  nonlinear  systems  is  the  fact 
that  nonlinear  systems  fail  to  satisfy  the  right-distributive 
property,  x(y  +  z)  ■  xy  +  xz.  They  do,  however,  satisfy  all 
of  the  other  axioms  for  a  ring  with  identity  including  the 
left-distributive  property  ( y  +  z)x—yx  +  zx .  As  such,  one 
can  attempt  to  extend  the  preceding  development  to  non- 
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linear  systems  by  carrying  it  out  in  left-distributive  rings, 
G  and  H  [23].  Indeed,  if  we  define  a  right  coprime 
fractional  representation  for  a  system  g  in  a  left-distribu¬ 
tive  ring  G  relative  to  {G,H,I,J}  precisely  as  we  did  in 
Section  II  the  fundamental  properties  1,  2,  and  3  go 
through  without  modification. 

Property  IN:  Let  g  -  n,d~ 1  be  a  right  coprime  frac¬ 
tional  representation  of  g  in  { G,H.I,J }  where  G  and  H 
are  left-distributive  rings  with  identity.  Let  n,  and  d,  have 
a  common  right  factor  rEH,  i.e.,  nr  =  xrr,  d,=y,r  for 
some  xrEH  and yr £ H.  Then  r  has  a  left  inverse  in  H. 

Property  2N:  Let  g**nrd~x  be  a  right  coprime  frac¬ 
tional  representation  of  g  in  {G,H,I,J}  where  G  and  H 
are  left-distributive  rings  with  identity.  Let  g  —  xry~'  be  a 
second  (not  necessarily  coprime)  right  fractional  repre¬ 
sentation  of  g  in  {(7,  //,  /,/};  then  there  exists  r  in  H  such 
that 

xr-n,r  and  y,  -  drr.  (6.1) 

Property  3N:  Let  g  *■  nrd~ 1  with  nrEH  and  drSl 
where  G  and  H  are  left-distributive  rings  with  identity. 

a)  If  drEJ,  then  g&H. 

b)  «  a  right  coprime  fractional  representa¬ 
tion  of  g  in  {G,H,I,J},  then  g&H  implies  drEJ. 

With  the  aid  of  property  3N  one  can  do  a  complete 
analysis  of  a  nonlinear  feedback  system  +/>)“'“ 

nrd~ 1  where  nrd~ 1  is  a  right  coprime  fractional  repre¬ 
sentation  of  \u.  Indeed,  hyu  is  well  defined  in  G  if  and 
only  if  4S/  and  it  is  well  defined  in  H  if  and  only  if 
drEJ.  Note,  however,  that  we  cannot  construct  our  frac¬ 
tional  representation  for  hyu  from  a  fractional  representa¬ 
tion  for  p  since  the  verification  that  such  a  representation 
is  coprime  appears  to  require  right-distributivity  [see  (3.4) 
and  (3.5)]. 

The  right  coprime  fractional  representation  plays  a  spe¬ 
cial  role  in  the  nonlinear  case  because  Ayu”p(l  +p)_l 
holds,  whereas  hyu^(l+p)~'p  does  not  (even  though  the 
latter  formula  is  true  for  the  linear  case).  As  such,  those 
results  on  the  analysis  of  feedback  systems  which  assume 
a  left  coprime  fractional  representation  theory  fail  as  does 
the  design  theorem  since  it  simultaneously  employs  both 
left  and  right  coprime  fractional  representations.  We  be¬ 
lieve,  however,  that  these  results  should  hold,  at  least  in 
part,  for  nonlinear  systems  with  an  appropriate  modifica¬ 
tion  of  the  theory.  In  particular,  since  the  rings  G  and  H 
are  asymmetric  we  believe  that  asymmetric  concepts  of 
left  and  right  coprimeness  will  be  required  to  achieve  this 
end. 

VII.  Conclusions 

Although  several  of  our  examples  are  characterized  by  a 
deep  analytic  structure  the  key  to  our  fractional  repre¬ 
sentation  approach  to  feedback  system  design  is  the  alge¬ 
braic  nature  of  the  main  results.  Indeed,  the  entirety  of 
our  modeling,  analysis,  and  synthesis  theory  was  derived 
with  no  more  sophisticated  mathematics  than  addition, 
multiplication,  subtraction,  and  inversion.  As  such,  it  ap¬ 


plies  to  essentially  any  class  of  linear  systems  and  by 
proper  choice  of  the  rings  G  and  H  the  results  are  applica¬ 
ble  to  a  variety  of  systems  problems. 

Although  we  believe  that  the  present  work  represents 
the  first  attempt  at  the  formulation  of  an  axiomatic  frac¬ 
tional  representation  theory  for  systems  which  may  be 
matched  to  the  feedback  system  analysis  and  synthesis 
problems  of  interest  the  work  owes  much  to  a  number  of 
recent  results  on  the  input-output  theory  of  linear  sys¬ 
tems.  The  use  of  a  fractional  representation  theory  for 
multivariate  systems,  though  implicit  in  a  number  of 
classical  results,  was  popularized  by  Rosenbrock’s  poly¬ 
nomial  matrix  fractions  [19].  Interestingly,  however, 
Rosenbrock’s  goal  was  apparently  to  permit  the  powerful 
analytic  and  arithmetic  theory  available  for  polynomial 
matrices  to  be  applied  to  rational  matrices  whereas  the 
present  fractional  representation  theory  is  motivated  by 
the  desire  to  formulate  a  representation  theory  for  systems 
which  is  closed  under  inversion.  Over  the  years  numerous 
generalizations  of  the  polynomial  matrix  fraction  concept 
have  been  formulated  for  distributed  systems  [4],  [5],  [13], 
[21],  and  multidimensional  systems  [9],  [24]  while  partial 
extensions  to  the  time-varying  and  nonlinear  cases  have 
appeared  in  a  number  of  unpublished  reports  [1 1],  [22]. 

For  any  type  of  fractional  representation  theory  to  be 
meaningful  it  must  be  identified  with  an  appropriate 
coprimeness  concept.  Indeed,  the  key  to  the  present  for¬ 
mulation  is  the  use  of  the  algebraic  coprimeness  concept 
of  (2.4)  in  lieu  of  the  more  classical  common  factor 
criterion.  Such  a  criterion  has  previously  been  applied  by 
one  of  the  authors  in  a  study  of  fractional  representations 
for  distributed  system  [4]  and  was  also  shown  to  be  the 
strongest  of  several  possible  coprimeness  criteria  for  mul¬ 
tidimensional  systems  by  Youla  and  Gnavi  [26].  Of 
course,  it  is  well  known  as  one  of  the  several  equivalent 
criteria  for  coprimeness  in  the  polynomial  matrix  fraction 
theory  [16],  [19], 

The  feedback  system  analysis  theorems  of  Section  III 
are  motivated  by  the  now  classical  theorems  for  determin¬ 
ing  the  stability  of  a  multivariate  feedback  system  in  terms 
of  its  polynomial  matrix  fraction  representation  [10]. 
Moreover,  the  system  synthesis  theorem  is  an  outgrowth 
of  the  feedback  system  stabilization  theorem  of  Youla  et 
al.  [24],  [25].  Indeed,  the  present  work  began  with  an 
attempt  to  give  a  simple  proof  of  this  most  powerful 
analytic  theorem  and  developed  through  several  stages  of 
generalization  and  simplification  into  its  present  form. 
Finally,  the  optimization  theory  used  in  Example  4  repre¬ 
sents  the  generalization  [9]  to  an  operator  theoretic  setting 
of  a  result  originally  developed  by  Youla  et  al.  in  the 
frequency  domain  for  use  in  conjunction  with  their  stabili¬ 
zation  theorem  [24],  [25]. 
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ABSTRACT 

The  problem  of  system  diagnosis  is  presented. 
Its  applitatinns  and  recent  results  are  discussed. 

r.  INTRODUCTION 

A  Large-Scale  Dynamical  System  (LSDS)  can  be 
described  as  one  which  is  either  impossible  or  im¬ 
practical  to  be  represented  by  a  single  composite 
system  equation,  and  Is  to  be  solved  as  such. 
Therefore,  it  Is  better  for  a  large-scale  system  to 
be  represented  as  an  Interconnection  of  components 
or  subsystems.  One  such  representation  is  present¬ 
ed  in  Section  II. 

Fault  diagnosis  is  a  typical  problem  of  large- 
scale  dynamical  systems.  It  is  a  problem  of  study¬ 
ing  the  (large)  deviations  of  the  overall  system 
caused  by  the  deviations  of  the  subsystems  (some¬ 
time^  simply  circuit  elements).  It  is  not  practical 
to  formulate  this  problem  from  a  single  composite 
system  equation. 

The  fault  diagnosis  of  analog  systems  at  present 
is  an  art  rather  than  a  science.  For  example,  it  is 
not  clear  how  the  concept  of  "fault"  can  be  defined. 
On  the  other  hand,  the  system  diagnosis  problem, 
while  related  to  the  fault  diagnosis  problem,  can 
be  well  defined.  This  is  done  in  Section  III. 

It  turns  rut  that  the  application  of  system 
diagnosis  problem  is  not  restricted  to  fault  diag¬ 
nosis  problems.  It  also  has  applications,  for 
example,  to  the  modeling  of  socio-economic  problems. 
A  discussion  ol  the  applications  of  system  diagnosis 
problems  Is  given  in  Section  IV.  Finally,  a  dis¬ 
cussion  of  the  recent  results  is  presented  in 
Section  V. 


where  x  “  col(x^)  and  a  and  b  are  similarly 
defined,  and  A  -  diag  (A^)  and  B,  C,  and  D  are 
similarly  defined.  For  definiteness,  let  xt  Rn  , 
ae  Rm,  and  be  Rr,  and  A,  B,  C,  and  D  be  con¬ 
stant  real  matrices  of  compatible  dimensions. 

The  connection  of  the  input  and  output  termi¬ 
nals  of  these  subsystems  with  the  input  terminals 
u  and  the  output  terminals  y  of  the  entire 
system  Is  assumed  to  have  the  form 


Here,  u  e  Rp  and  y  e  R1*  and  L  and  R  are  con¬ 
stant  real  matrices  with  compatible  dimensions. 

The  model  of  LSDS  defined  by  (2)  is  a  linear 
and  time- Invariant  one  and  it  can  be  represented 
in  the  forms  of  block  diagrams  in  Fig.  1.  Two 
special  classes  of  LSDS  are  of  special  interest. 
Consider  a  linear  integrating  circuit.  Each  of  its 
n-port  subnetworks  can  be  represented  by  (1)  and 
the  ICVL  and  XCL  equations  have  the  form  of  (2b). 

As  such,  ai  and  bi  are  the  port  voltages  and  the 
port  currents.  Therefore,  any  linear  integrating 
circuit  can  be  represented  by  an  LSDS  model. 
Consider  a  block-diagram  connection  of  linear 
time- invariant  subsystems  (1) .  Since  every  block- 
diagram  connection  can  be  represented  by  (2b) , 
the  above  system  can  also  be  represented  by  an 
LSDS  model.  Therefore,  the  LSDS  model  (2)  includes 
both  network-type  and  system-type  formulations. 

In  this  paper,  we  only  consider  linear  time- 
invariant  LSDS's  which  can  be  represented  by  (2). 

When  the  composite  component  equation  (2a)  is 
represented  by  a  transfer  function 


II.  A  MODEL  OF  LINEAR  TIME- 
INVARIANT  LSDS 


Let  un  LSDS  be  comprised  by  a  finite  number  of 
subsystems,  each  of  which  Is  a  linear  time- invariant 
dynamics L  system  of  the  type 


Vi  +  Vi 


bi  ■  Vi  +  Vi 


i  •  l,2,...,k,  where  a^,  ,  x,  are  the  Input 

vector,  the  output  vector,  and  the  state  vector  of 
each  of  the  components.  They  are  not  necessarily 
of  the  same  dimension.  The  matrices  A^,  B.,  Cp 
and  Da  are  compatible  to  thesa  vectors.  These 
components  con  be  put  into  a  single  composite  com¬ 
ponent  equation. 


b  -Z(s)a 


(3) 


where 

2(8)  -  C(sl  -  A)_1B  +  D  (4) 

It  becomes  the  component  connection  model  in  the 
frequency  domain,  considered  by  Saeks  et  al  11-4], 
Combining  (3)  with  (2b),  we  have 


y  -S(Z)u  (5) 

where 

S(Z)  -  L22  s-  l2J(1  -ZL^r^L^  (6) 

which  is  used  for  the  purpose  of  system  diagnosis. 

Trick  et  al  [5]  considered  the  special  case 
when  (2b)  represents  KV1.  and  KCL  equations.  When 
fundamental  cut-set  equations  and  fundamental 
loop  equations  are  used,  Fq.  (2b)  becomes, 


x  "  Ax  +  Ba 
b  »  Cx  +  Da 


(2a) 


(7) 


where  v  .  i  ,  ire  i  lie  bi anch  vol  capes  nd  branch 
C  C  C  i 

cunvnet  in  i  In  cotm*  and  vc  and  It  ar.  chose  In 
che  i  ree  [61.  iiimpar  i  ng  (  )  wich  '2b),  w  have 


and  consequently, 

r  T  T,  *ct  ,  .  ,  .  4c.C 

lvct  4t  J  "  flct  vt]  L 

vc  vc 


0  (9) 


from  lnpuc/oucpuc  measurements.  Only  Markov  param- 
ecers  can  be  uniquely  decermined. 

Assercion  1  (11).  Two  minimal  (concrol lable 
and  observable)  linear  cime-invarianc  scace  equacion 
represencacion  (F,G,H,J)  and  (F,5,H,J)  with  state 
spare  of  che  same  dimension  n  are  realization  of 
the  same  transfer  function  macrix  H(s)  if  and 

onlv  if 


The  last  equality  follows  from  (9)  that  L  is 
anti-symmetric.  £q.  (9)  is  a  ■generalization  of 
Tellegen  Theorem  since  it  is  derived  from  a  more 
weaker  condition  (9)  instead  of  (7). 

Navid  and  Willson,  Jr.,  [7]  considered  che 
case  of  DC-netvork,  i.c.,  the  connection  equation 
again  has  the  form  (7)  and  furthermore,  Che  com¬ 
posite  component  equation  (2a)  or  (3)  has  the  form 

b  -  Da  (10) 

wich  the  condition,  among  others,  thac  each  row  of 
D  1ms  one  and  only  one  non-zero  element. 

We  have  presented  the  mathematical  model  (2a) 
and  (2b)  and  its  relation  to  other  models.  Next, 
we  want  to  define  the  system  diagnosis  problem. 


H  FiG  -  H  FiG, 


0,1, .. . ,2n-l. 


The  Markov  parameters  m. 


mQ  A  J 

“i  -  H  Fi-1G 


i-1 . 2n 


are  well  defined  functions  of  p  .  Therefore,  we 
can  write 


^(P)  ,  i  -  0,1 . 2n 


III.  SYSTEM  DIAGNOSIS  PROBLEMS 

With  the  model  (2a)  and  (2b),  the  system 
diagnosis  problem  can  be  stated: 

The  System  Diagnosis  Problem:  Given  L  and 
n  (dimension  of  x) ,  the  problem  is  to  determine  the 
component  parameters  (A,B,C,D)  from  inpuc/aucpuc 
measurements. 

Let  us  now  formalize  the  problem.  Let  us 
assume  that  there  exists  a  state  equation  for  the 
component  connection  model  (2a)  and  (2b)*  of  the 
form:  # 

z  “  Fz  +  Gu 

y  »  Hz  +  Ju  (11) 

where  zeRn,  ii£n.  The  transfer  function  is  given 
by 

H(s)  -  H(sl  -  F)-1G  +  J  (12) 

Let  che  vector  p  denote  the  component  parameters 
(A, B.C.O)  to  be  decermined.  Its  relation  to  the 
transfer  function  is  given  by 

H(s)  -  h(s,  p)  (13) 

where  h  is  determined  by  che  system  structure 
(L,  n). 

Since  by  mechods  of  system  identification 
techniques  [9,10],  the  transfer  function  H(s) 
can  be  obtained  from  proper  inpuc/output  measure¬ 
mencs,  the  problem  of  system  diagnosis  is  reduced 
to  the  problem  of  the  determination  of  p  from 
H(s). 

Definition  1.  The  parameters  p  of  ar<  LSDS 
are  said  to  be  dlagnosable  is  p  can  be  determined 
from  H(s),  i.e. ,  h  is  injective. 

The  same  problem  can  be  formulated  in  the 
time-domain.  It  turns  out  chat  Che  scace  equacion 
(F,G,H, J)  of  (11)  cannot  be  uniquely  determined 


where  m  •  col. (m.)  and  F  •  col.fF^. 

Assertion  2.  The  parameters  p  of  the  LSDS 
are  diagnosable  if  and  only  if  F  is  injective. 

Therefore,  the  basic  problem  of  svstem  diagno¬ 
sis  is  to  study  the  invertability  of  h  (the  fre¬ 
quency  domain)  or  that  of  F  (the  time  domain) . 

Let  us  consider  an  example. 

Example  1.  Consider  the  two-stage  RC  ladder 
as  shown  in  Fig.  2.  The  composite  component 
equacion  (2a)  is  given  by 


0  i  D  0 


vc  -  Col. (v- 


and  similarly  for  ic,  vG,  lc,  and 

G  -  diag(Cx ,  G2) 

D  »  ding (D^ ,  Dj) 

where  *  1/C^,  i  ■  1,2. 

T'"e  connection  equacion  (2b)  is  given  by 


rvr~|  r  o  i-i  o  •  q  rv 

. i  i  -i ;  o  j 

<  1  -1  1  o  « 0  „ 


0_1  1 _ !_C 

1  0,0  0  Jo 

I  •  - 


*  The  n.a.s.  condition  for  the  existence  of  state 
equation  for  (2a)  and  (2b)  is  given  by  Singh  and 
Liu  [8]. 


Equations  (20)  and  (21)  have  a  state  equation 
representation 


can  be  easily  carried  ouC.  Therefore,  if  a  I.SUS 
is  diagnosable,  it  is  also  fault  diagnostic . 
However,  the  diagnosability  is  sometimes  too  strong 
for  the  problem  of  fault  diagnosis,  because  it  is 
not  necessarily  to  determine  the  values  of  p  in 
order  to  test  whether  or  not  p  e  Cl  .  Furthermore, 
the  cost  of  the  evaluation  of  the  values  of  p  is 
high,  and  accuracy  is  so  poor  that  the  calculated 
values  of  p  become  useless  [see  Section  V], 
Therefore,  the  art  of  fault  diagnosis  is  to  find 
ways  of  testing  whether  or  not  p  e  (1  with  little 
calculation  of  the  values  of  p  .  Nevertheless, 
the  diagnosability  does  provide  the  Insight  for 
searching  such  an  art  and  the  design  of  test  points. 

A  special  issue  on  Automatic  Analog  Fault 
Diagnosis  [12],  edited  by  S.  D.  Bedroslan,  contains 
much  information  and  aany  references  on  this  topic. 


2.  Network  Synthesis 

Given  a  prescribed  transfer  function  H°(s)  , 
a  network  synthesis  problem  is  to  find  a  component 
connection  model,  (2a)  and  (2b),  so  that  its  trans¬ 
fer  function  H(s)  equals  the  prescribed  one 
H°(s).  If  the  connection  I.  is  predetermined, 
then  the  rest  of  the  synthesis  problem  is  to  de¬ 
termine  the  parameter  values  p  which  yield  the 
prescribed  transfer  function  H°(s).  This  is 
precisely  the  problem  of  system  diagnosis.  Note 
that  the  state-of-the-art  of  network  synthesis  is 
mainly  on  single-input  single-output  systems,  while 
the  system  diagnosis  problems  are  not  restricted 
to  SISO  problems. 


m  -  F(p)  (25) 

where  p  •  (G^ .G^.D^ .D^)  for  both  (24)  and  (25). 
Note  that  is  not  needed  since  we  only  have 

four  parameters  to  be  determined. 

The  parameter  p  is  diagnosable  if  and  only 
if  the  function  h  or  F  is  invertible.  In  ocher 
words,  we  want  to  solve  for  p  from  either  (24) 
when  the  transfer  function  H(s)  is  measured,  or 
from  (25)  when  the  Markov  parameter  m  is  measured. 

Note  that  Che  system  diagnosis  is  an  extension 
of  system  identification.  The  purpose  of  system 
identification  is  to  determine  either  the  coeffi¬ 
cients  of  a  transfer  function  or  the  Markov  para¬ 
meters  from  input/output  measurements.  The  purpose 
of  system  diagnosis  is  to  determine  the  component 
parameters.  The  latter  are  usually  the  ones  to  be 
designed. 

Please  note  that  the  functions  h  and  F 
are  nonlinear.  Therefore,  the  system  diagnosis 
problem  is  a  nonlinear  problem  even  if  the  original 
system  is  linear. 

Finally,  che  above  problem  can  be  modified  to 
be  more  flexible.  For  example,  we  may  know  some  of 
the  component  parameters  and  want  to  determine  che 
rest  of  them.  The  system  diagnosis  problem  can  al¬ 
so  be  extended  to  nonlinear  systems. 

IV.  APPLICATIONS 


1.  Fault  Diagnosis 

Let  fl  be  the  set  of  component  parameters 
p  for  tdiich  the  system  is  fault-free,  i.e., 

1)  if  p  e  £1^  chen  the  LSDS  is  fault-free,  and 

2)  if  pcG-,  then  the  LSDS  is  at  fault. 

When  the  LSDS  is  at  fault,  che  fault  diagnosis  prob¬ 


lem  is  to  determine  which  components  of  p  causing 
ncl^  .  Oearly,  if  p  can  be  determined  and  fl 
is  clearly  defined,  the  test  whether  or  not  p c 


3.  Modeling  of  Nonlinear  Circuit  Models 

The  circuit  shown  in  Fig.  3  can  be  used  to 
model  the  electrical  behaviors  of  the  nerve 
membrane* .  Chua  found  that  the  Potassium  conduc¬ 
tance  gk  and  the  sodium  conductance  gNg  should 
be  modeled  as  a  nonlinear  time-invariant  memristive 
1-ports  [13]  instead  of  the  linear  time-variant 
elements  in  the  original  model.  They  have  the 
state  equation  form: 

n  •  f (n,  vK) 
iK  -  GK(n)vR 


for  gg,  and 


■  ’  fl("’  vNa> 
h  -  f2(h,  vNa) 

iNa  "  GNa(”’  h)vNa 


for  gNa  .  The  state  variables  are  n  and  (m,  h) 
respectively.  The  explicit  expressions  of  che  non¬ 
linear  functions  are  given  in  [13].  One  of  the 
problems  is  to  identify  the  parameter  values  in  the 
circuit  model  given  in  Fig.  3.  This  problem  can  be 
considered  as  che  system  diagnosis  problem  of  a 
nonlinear  system. 


Modeling  of  Soclo-Econoaical  Systems 
Two  forms  of  models  are  often  used  by  econo¬ 


mists,  the  structural  form  and  the  reduced  form 
[14].  This  can  best  be  illustrated  by  an  example. 


Consider  che  economic  model: 


*Nodgkin  and  Huxley  were  awarded  the  Nobel  Prize  in 
1963  for  their  development  of  this  circuit  model. 


“o  + 


I(t)  -  \il  Y(t-l)  +  B2  K(t) 
Y(t)  S  C(t)  +  1(C)  +  0(c) 


problem  of  electrical  networks,  wich  known  nominal 
component  parameters.  By  means  of  Tellegen's 
Theorem  and  adjoint  networks,  che  following  equation 
can  be  obtained  for  a  linear  network: 


C  *  Consumption 
I  »  Investment 
Y  *  National  Income 
G  •  Government  Expenditure  on  Goods 
and  Services 
T  ■  Taxes  on  Income 
R  «  Government  Regulator 

The  last  equation  is  an  identity.  The  above  set 
of  equations  can  be  solved  to  yield, 


where  M  and  &Q  are  measurable,  and  dp  is  che 
difference  between  Che  nominal  values  and  Che 
actual  values  of  p  .  Therefore,  p  is  diagnosable 
if  and  only  if  M  is  invertible.  Note  that  Eq. 
(26)  is  in  general  nonlinear  while  Equation  (27)  is 
linear.  The  latter  is  achieved  at  the  expense  of 
more  test  points. 

Navid  and  Willson,  Jr.,  [7]  considered  the 
diagnosis  problem  for  resistive  networks.  In  this 
case  Eq.  (6)  takes  the  form 

g(R)  -  1*22  +  L21(I  -  R  Lu 


o0  “j8,  M2  al  _  “  11  12 

m  i-a  +  i_a  Y(t-l)  +  i_o  R(t)+2  [G(t)-T(t)_^  where  R  is  the  resistive  matrix.  By  taking  ad- 

i  1  vanrio«  nf  r  Visa  f  r  .  .  T  .  »  T  ••  T _ 


I(t)  -  BjYCt-l)  +  B2R(t)  (R) 

°0  ®i  1  °1 

1-a ^  l-a^  1_®2  2”°1 

The  above  equations  give  the  explicit  input  (?,R,G) 
and  output  (C,I,Y)  relation.  The  first  set  of 
equations  is  called  che  structural  form  and  che 
second  set  of  equations,  the  reduced  form.  Clearly, 
che  reduced  form  is  like  the  transfer  function 
representation.  The  structural  form  is  like  the 
component-connection  model  with  the  third  equation 
(the  identity  equation)  being  the  connection 
equation.  Consequently,  che  system  identif iciation 
techniques  can  be  used  to  identify  the  coefficients 
of  the  reduced  form,  while  the  system  diagnosis 
technique  can  be  used  to  identify  che  coefficients 
of  the  structural  form. 

V.  DISCUSSION  OF  RECENT  RESULTS 

The  system  diagnosis  problem  can  be  classified 
into  three  aspects:  (A)  the  problem  of  dlagnos- 
ablllty,  (B)  che  computational  problem,  and  (C) 
che  design  of  test  points. 

(A)  Dlagnoaabllltv 

Three  representative  results  on  the  condition 
of  dlagnnsabllity  will  be  presented. 

Sen  and  Saeks  [3]  have  considered  LSDS  which 
can  be  represented  by  (6) 

g(s,p)  A  S(Z(  <,p))  -  L_~  +  L-.  [I-Z(s,p)L. ,  f  4(s,p)L,,  1 


vantage  of  the  fact  that  L21,  L22,  L2i  and  L22 
arrived  from  a  network  graph,  they  were  able  to 
find  an  elegant  condition  for  the  invertability  of 


g  .  This  condition  depends  on  only  the  graph,  not 
the  component  values. 

(B)  Computational  Problems 

As  it  was  discussed  in  Section  III,  the  system 
diagnosis  problem  is  in  general,  nonlinear,  even 
when  the  system  is  linear.  When  the  dimension  of 
p  is  large,  the  computational  cost  will  be  high 
and  the  results  will  be  inaccurate.  The  following 
example  will  illustrate  the  point. 

Example  2  [15].  Consider  a  4-stage  RC-ladder 
as  shown  in  Fig.  4.  The  transfer  function  of  the 
ladder  is  given  by 

y(#)  .  V4  +  v3  *  V2  +  b3s  L  Vfi 

aQs4  +  alS3  +  a2s2  +  a3s+l  Dl(s) 

The  component  values  can  be  calculated  from  the 
coefficients  ai's  and  b^'s  by  a  continue- 
fraction  expansion  [16]  as 
0,(8) 

Ri  -  s1^  N^> 

C,  -  Urn  Ni<*> 

w> 


By  measuring 


at  multiple  frequencies 


the  matrix  f<*r 


che  above  equation  can  be  expressed 


-  G(p) 


Ni-l(s)  "  Ci-1  Dl(,) 

Di_j(s)  -  R1_l  Nl.1(s> 


Now,  we  first  arbitrarily  chose  che  parameter  val¬ 
ues  to  be 


R2  •  47 


which  is  a  function  of  p  .  Clearly,  p  Is  .li- 
agnosable  If  and  only  if  G  Is  Invertible. 

Trick  et  a]  [Sj  considered  the  diagnosis 


The  cx.h*  t  v.il'i.-  of  t‘>e  eoeff  Lcion  ts  can  be  calcu¬ 
lated: 


aQ  »  0.001  1 1 '.06 1 88 
Uj  •  0.249'.  1.51 6 
a2  •  7.  3770808 
a j  -  25. 01  .8 


b()  -  5. 977554E-4 
bj  «  0.09526598 
b2  *  0.492064 
b3  -  0.517 


(29) 


Wc  now  calculate  the  component  values  by  Che  con- 
tinue-fraetioonl  expansion  method.  If  the  exact 
values  of  coefficients  (29)  are  used  and  If  there 
is  no  compute rlonal  error,  the  component  values 
given  in  (28)  should  he  completely  recovered.  Two 
cases  have  been  examined.  When  the  exact  values 
of  (29)  are  used,  the  calculated  component  values 
are  completely  agreeable  with  chat  of  (28).  This 
indicates  the  error  dne  to  computation  is  small. 
Now,  when  significant  value  of  (29)  is  reduced  to 
seven,  the  calculated  component  value  becomes, 


R2  -  47.005 

R3  -  -0.018072 

R.  -  9.1305 
4 


C3  -  0.015 

C2  -  0.70919 

C3  -  -0.23768 

C.  -  0.030482 
4 


which  have  significant  deviation  from  (28) .  In 
fact,  R3  and  C3  are  negative!  In  reality,  one 
can  hardly  measure  the  transfer  function  coeffici¬ 
ent  to  7-dlglts  accuracy.  This  example  shows  that 
even  p  is  diapnosable  but  it  may  not  be  computable 
in  realistic  situations. 

Let  us  now  re-examine  the  RC-ladder  and  di¬ 
agnose  it  from  another  way.  Observe  the  time- 
domain  equations  for  2-stage  RC-ladder  (25),  l.e.. 


m 

m 


m 


0 

1 

2 

3 


-<".V  *  ciVi2> 

3  3  2  2  2  2 

VY(G1  +  V  +  G1  C2  D1  D2 


the  test  points.  This  process  at  present  is 
more  of  an  art  than  a  science.  There  are  two 
results.  Sen  and  Saeks  [3]  provide  a  test  point 
selection  algorithm.  Visvanathan  and  Liu  [17] 
provide  a  design  procedure  so  that  the  diagnosis 
equation  (25)  is  sequentially  linear. 

ACKNOWLEDGEMENT 

This  paper  was  supported  in  part  by  the 
Office  of  Navel  Research  under  Grant  N00014-78-C- 
0444. 


VI .  REFERENCES 

[1]  R.  Saeks,  S.P.  Singh  and  R.W.  Liu,  "Fault 
Isolation  Via  Components  Simulation",  IEEE 
Trans.  Circuit  Theory,  Vol.  CT-19,  pp.  634- 
640,  Nov.  1972. 

[2]  M.N.  Ransom  and  R.  Saeks,  "Fault  Isolation 
with  Insufficient  Measurements" ,  IEEE  Trans ■ 
Circuit  Theory.  Vol.  CT-20,  pp.  416-417,  1973. 

[3]  N.  Sen  and  R.  Saeks,  "Fault  Diagnosis  for 
Linear  Systems  via  Multifrequency  Measurements", 
IEEE  Trans.  Circuits  and  Systems.  Vol.  CAS-26 
pp.  457-465,  1979. 

[4]  H.S.M  Chen  and  R.  Saeks,  "A  Search  Algorithm 
for  the  Solution  of  Multifrequency  Fault 
Diagnosis  Equations",  IEEE  Trans.  Circuits 
and  Systems,  Vol.  CAS-26,  pp.  589-594,  1979. 

[5]  T.N.  Trick,  W.  Mayeda  and  A. A.  Sakla,  "Calcu¬ 
lation  of  Parameter  Values  from  Node  Voltage 
Measurements",  IEEE  Trans.  Circuits  and 
Systems.  Vol.  CAS-26,  pp.  466-474,  1979. 

[6J  S.  Seshu  and  M.B.  Reed,  Linear  Craohs  and 
Electrical  Networks.  Addison  Wesley,  1961. 

[7]  N.  Navid  and  A.N.  Willson,  Jr.,  "  A  Theory 
and  an  Algorithm  for  Analog  Circuit  Fault 
Diagnosis",  IEEE  Trans.  Circuits  and  Systems. 
Vol.  CAS-26,  pp.  440-457,  1979. 


Note  that  G^  can  be  solved  from  the  first  equation. 
After  G^  is  obtained,  D^  can  be  solved  from  the 
second  equation.  After  (G^,  D^)  are  obtained, 

Qy  can  be  solved  from  the  third  equation.  Finally, 
after  (G^,  Dj,  Gj)  are  obtained,  D2  can  be 
solved  from  the  fourth  equation.  Observe  further, 
that  each  time,  the  equation  to  be  solved  is  a 
linear  one.  Therefore,  the  above  set  of  nonlinear 
equations  is  called  sequentially  linear.  It  can 
be  shown  that  for  any  n-stage  RC-ladder,  Eq.  (25) 
is  sequentially  linear.  As  a  consequence,  both 
the  computational  cost  is  reduced  and  accuracy  is 
Improved  tremendously.  Liu  and  Visvanathan  117] 
have  provided  a  sufficient  condition  for  a  system 
whose  diagnosis  equation  is  sequentially  linear. 
However,  this  condition  may  be  too  strong  because 
it  does  not  include  the  n-stage  RC-ladder,  although 
the  latter  motivated  the  study. 

(C)  Design  of  Test  Points 

If  component  parameter  p  of  a  LSDS  is  not 
diagnosable,  how  can  <<ne  make  it  diagnosable? 

The  obvious  answer  is  that  we  need  more  measurements, 
or  more  test  points.  How  marfy  and  what  location? 
These  are  the  questions  faced  by  those  who  design 


[8]  S.P.  Singh  and  R.  Liu,  "Existence  of  State 
Equation  Representation  of  Linear  Large-Scale 
Dynamical  Systems”,  IEEE  Trans.  Circuit  Theory. 
Vol.  CT-20,  pp.  239-246,  May  1973. 

[9]  D.  Graupe,  Identification  of  Systems.  Krieger 
Publishing  Co.,  1976. 

[10]  P.  Eykhoff,  System  Identification.  John  Wiley 
&  Sons,  1977. 

[11]  B.L.  Ho  and  R.E.  Kalman,  "Effective  Construc¬ 
tion  of  Linear  State  Variable  Models  from 
Input-Output  Data”,  Proc.  3rd  Ann.  Allerton 
Conf.  Circuit  and  System  Theory,  pp.  449-459, 
1965. 

[12]  S.D.  Bedroslan,  (editor).  Special  Issue  on 
Automatic  Analog  Fault  Diagnosis.  IEEE  Trans. 
on  Circuits  and  Systems.  July  1979. 

[13]  L.0.  Chua,  "Device  Modeling  Via  Basic  Non¬ 
linear  Circuit  Elements",  Special  Issue  on 
Nonlinear  Circuits  end  Systems.  Nov.  1980, 

(To  appear). 


A  Comparison  of  Frequency-Domain  Approach  and  Time-Domain  Approach 


C .  Lin 


A  COMPARISON  OF  FREQUENCY-DOMAIN  APPROACH 
AND  TIME-DOMAIN  APPROACH:  A  Case  Study  of 
Fault  Analysis  of  Analog  Circuits 

by 

Chen-Shang  Lin 
Technical  Report  No.  EE -802 

May  23,  1980 


This  report  was  supported  in  part  by  the  Office  of  Navel  Research  under 
Grant  N0014-78-C-0444. 


I.  Introduction 


There  have  been  considerable  efforts  expended  in  analog  fault  analysis. 

Most  of  them  [1],  [2],  [3]  employ  frequency-domain  approach,  i.e.,  diagnosing 
faulty  components  from  measured  transfer  function,  while  few  [5]  use  time- 
domain  approach  to  isolate  faults  by  means  of  Markov  parameters.  Theoretically, 
both  approaches  are  still  under  development  and  all  seem  feasible.  It  is  the 
purpose  of  this  report  to  compare  these  two  approaches  numerically  by  sim¬ 
ulation  on  RC  ladders.  In  this  example,  it  is  shown  that  the  time-domain 
approach  is  far  better  than  the  frequency-domain  approach. 


-  numerator  polynomial  of  Z 


denominator  polynomial  of  Z 
Nw(s)  -  Cw  •  D±(s) 
Di_1(s)  -  Ri_1  •  Ni_1(s) 


On  the  other  hand,  the  ladder  has  a  state  equation  expression 


k  m  Ax  +  Bu 
y  *  Cx  +  Du 


where  x  are  the  capacitor  voltages,  u  the  terminal  current,  y 
terminal  voltage  and 


which  can  be  measured  by  a  method  developed  by  Liu  and  Suen  [5], 


Once  the  Markov  parameters,  nu,  i  *  0,1, . . . ,2n-l,  are  obtained,  the 
circuit  parameters  R' s  and  C's  can  be  solved  from  the  simultaneous 
equations (2)  and  (3). 

In  this  simulation,  it  assumes  no  measurement  error  for  both  Z(s)  and 
m^'s  .  They  are  exact.  We  want  to  find  the  numerical  error  generated  by 
solving  (1)  and  (3) . 

Ladders  of  four  and  six  stages  were  chosen,  transfer  functions  and  A,  B, 

C,  D  parameters  were  calculated  using  nominal  values  of  components.  Then,  as 
a  way  of  comparison,  the  significant  digits  of  coefficients  of  transfer  'function  and 
entries  of  A,  B,  C,  D  were  reduced  before  we  performed  the  manipulation  by 
these  two  methods.  The  results  are  listed  in  Tables  1  and  2. 

It  is  clear  from  the  tables  that,  as  significant  digits  decrease,  the 
estimated  values  of  frequency-domain  method  stray  away  from  nominal  values 
gradually,  then  collapse  abruptly  at  a  certain  point  and  become  unrealizable, 
i.e. ,  some  of  the  values  become  negative.  On  the  contrary,  the  results  of  the 
time-domain  method  remain  about  the  same  order  of  accuracy  as  parameters  of 
state  equation. 

The  discrepancies  may  be  due  to  the  following  reasons: 

1)  The  frequency-domain  approach  deals  with  computations  of  complex 
numbers  while  the  time-domain  approach  deals  with  computations  of  real  numbers  - 

2)  The  given  circuit  is  sequentially- linear  for  the  time-domain  approach 
[5,6].  This  can  be  demonstrated  by  the  4-stage  RC  ladder.  Solving  (2)  and 

(3) ,  we  have 

*o  ■  R1 


“X  0, 


References 


[1]  R.  Saeks,  S.P.  Singh  and  R.W.  Liu,  "Fault  Isolation  via  Components 
Simulation",  IEEE  Transaction  on  Circuit  Theory.  Vol.  CT-19,  No.  6, 
pp.  634-640,  November,  1972. 


[2]  N.  Sen  and  R.  Saeks,  "Fault  Diagnosis  for  Linear  Systems  Via 

Multifrequency  Measurements,"  IEEE  Trans,  on  Circuits  and  Systems, 
Vol.  CAS-26,  pp.  457-465,  1979. 


[3]  T.N.  Trick,  W.  Mayeda  and  A. A.  Sakla,  "Calculation  of  Parameter 

Value  from  Node  Voltage  Measurements,"  IEEE  Trans,  on  Circuits  and 
Systems,  Vcl.  CAS-26,  pp.  466-474. 


[4]  R.  Liu  and  L.C.  Suen,  "Minimal  Dimension  Realization  and  Identif iability 
of  Input/Output  Sequence,"  IEEE  Trans,  on  Automatic  Control,  April  1977. 


[5]  R.  Liu  and  V.  Visvanathan,  "Sequentially  Linear  Fault  Diagnosis:  Part 
I-Theory,"  IEEE  Trans,  on  Circuits  and  Systems,  Vol.  CAS-26,  pd.  490- 
496,  1979. 


[6]  V.  Visvanathan,  "Sequentially  Linear  Fault  Diagnosis:  Part  II-The  Design 
of  Diagnosable  Systems,"  IEEE  Trans,  on  Circuits  and  Systems,  Vol.  CAS-26, 
pp.  558-564,  1979. 


IV.  Conclusion 


This  simulation  strongly  suggests  that  time-domain  approach  is  more 
data-tolerant  than  frequency-domain  approach  in  the  sense  that  no  sudden 
breakdown  occurs  and  component  values  can  be  estimated  with  reasonable 
accuracy  when  the  measurement  is  not  accurate  enough  or  where  the  noise  must 
be  taken  into  consideration.  Thus,  though  it  is  still  too  early  to  have 
definite  conclusion,  time-domain  approach  seems  to  be  a  more  promising  method 
in  attacking  fault  diagnosis  problem. 


Time-Domain  Method  Frequency-Domain  Method 
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Significant  Digits 


10 


Time-Domain  Method 

R1 

R2 

R3 

R4 

C1 

C2 

C3 

C4  1 

2.2 

47 

8.2 

1 

.015 

.47 

.01 

.022 

2.2 

47 

8.2 

1 

.015 

.47 

.01 

.022 

2.2 

47 

8.2 

1 

.015 

.47 

.01 

.022 

2.2 

47 

8.2 

1 

.015 

.47 

.01 

.022 

2.2 

47 

8.2 

1 

.015 

.47 

.01 

.022 

2.2 

47.017 

8.2044 

1.0003 

! 

.014999 

.46982 

.0099962 

.021997 

2.2 

47.857 

8.4454 

1.0569 

i  .014925 

I - 

.46434 

.0096776 

.020555 

Frequency-Domain  Method 


"1  2  3  "4 


2.2  47  8.2  1 

.015  .47  .01  .022 


2.2  47  8.2  1 

.015  .47  .01  .022 


2.2  47  1.3587  7.7603 

.015  .46597  .0056867  .030347 


2.2  47.005  -.018072  9.1305 

.015  .70919  -.23768  .030482 


Table  1:  In  the  frequency-domain  approach,  the  calculated  values  of  and  become 

negative  when  the  significant  digits  are  reduced  to  7.  No  such  discrepancies 
in  the  time-domain  approach. 
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Abstract 

The  problem  of  designing  an  optimal  state  estimator  for  a 
linear,  discrete-time  system  with  a  singular  noise  covariance 
matrix  is  considered.  In  this  article,  this  problem  is  cast  as 
a  constrained  optimization  problem  and  the  approach  appears  to 
be  more  direct.  Solution  to  this  optimization  problem  gives  a 
reduced-order  optimal  state  estimator. 

I.  INTRODUCTIONS 


In  a  linear  stochastic  system,  the 
output  measurement  may  be  only  partially 
noise  corrupted.  Although,  in  practice, 
one  may  argue  that  there  exist  no  noise- 
free  measurements,  it  is  quite  possible 
that  some  of  the  measurements  are  noise 
corrupted  while  the  others  are  relatively 
accurate.  Under  the  Gaussian  assumption, 
this  implies  that  the  noise  covariance 
matrix  has  both  large  and  small  eigen¬ 
value*,  which  easily  leads  to  numerical 
difficulties  in  the  implementation  of  the 
Kalman  filter.  It  is  convenient  in  this 
case  to  model  these  more  accurate  measure¬ 
ments  as  noise- free  entities. 

The  study  of  this  problem  dates 
back  to  the  work  of  Bryson  [1],  for  con¬ 
tinuous-time  systems,  and  that  of  Brammer 
[2],  for  discrete- time  systems. 

Kwakemaak  [3]  and  Anderson  [4]  discussed 
this  problem  as  a  singular  linear  state 
estimation  problem;  however,  no  explicit 
solutions  were  given.  Tse  and  Athens  {5] 
derived  a  rather  complicated  "observer- 
estimator"  which  is  essentially  an  exten¬ 
sion  of  the  Luenberger  observer  (6] . 

Later,  Yoshikaw*  [7]  gave  a  simpler  deri¬ 
vation  for  minimum-order  optimal  state 
estimators.  More  recently,  Fairman  [8] 
proposed  a  "hybrid  estimator"  which 
features  " coord inatizat ion"  and  achieved 
a  reduced-order  optimal  estimator. 

The  main  feature  of  the  approach 
used  in  this  peper  is  the  following; 

After  a  proper  similarity  transformation, 
the  state  variables  are  decomposed  into - 


two  parts,  one  to  be  estimated  by  a  reduced- 
order  filter  and  the  other  to  be  recovered 
exactly  from  the  noise- free  measurements. 
Then  the  dynamic  equation  of  the  latter 
part  of  the  state  equation  is  considered  a 
constraint  on  the  optimal  estimation  of  the 
other  part  of  the  states.  Hence  the  state 
estimation  problem  in  this  case  is  cast  as 
a  constrained  optimization  problem,  which 
leads  to  a  reduced-order  optimal  state 
estimator. 

12.  PROBLEM  FORMULATION 

A  linear,  discrete-time  stochastic 
system  can  be  described  by  the  following 
equations 

x(k+l)>A(k)  x(k)+B(k)  u(k) ,  k-0,1,2,...  (1) 

y(k)  -  C(k)  x(k)  +  v(k)  ,  k*l,2, ...  (2) 

where  x(*)c  Rn»  u(*)c  Rp,  and  y(*)c  R?“. 

To  further  specify  the  problem,  the  follow¬ 
ing  assumptions  are  made; 

U).  x(0) ,  u(0) ,  u(l),...,v(l),  v(2) , 
....  are  independent  random  vec¬ 
tors  with  the  following  statistics 

B[x(0)l-xft  Elx(0)xT(0)]-V 

0  *Q 

E[u(k)l-2Fk  ltu(k)uT(k-i)] 

-Vu(k)6(i)  Tk.i 

S[v(k)  ]■£  Vk  E[v(k)vT(k-i)J 


u 


PtoUtnttd  at  tSJid  Uidotit  SympaUum  on  CUiauti  and 
Syitau,  Aug utt  19i0f  to  bt  published  in  tht  Pnocuding* 
o(  tht  Conivunct 


Btu(1c)vT(i)]-OpXB  Tk.i 


Blv<k)xT(0) 


-Vv(k)6(i)  Tic,  i 
Stu(k)xT<0>] 

“  °px„  ** 


m  m 

where  u  (•)  and  v  (•)  danotas  the  trans- 
poae  of  vectors  u( • )  and  v(«),  respective- 
ly,  and  6( • )  deontea  the  Kronecker  delta, 
(ii)  For  any  k,  the  Vy(k)  is  a  non¬ 
negative  definite  matrix  with 


rank  m. 


where  m^<n. 


Under 


this  assumption,  implementation 
of  the  standard  Kalman  filter 
involves  inversion  of  a  matrix 
which  may  be  singular.  Tse 
and  Athans  [5]  proposed  an  ob¬ 
server-estimator  of  order  n-m2 

which  performs  as  well  as  higher 
order  estimators,  where  n^Am-m^. 

(iii)  For  any  k,the  C(k)  is  of  full 

rank,  i.e.  every  element  of  the 
output  measurement  is  indepen¬ 
dent  of  the  others. 

The  objective  here  is  to  design  an 
optimal  state  estimator  of  order  n-n^. 

Without  loss  of  generality,  one  can  assume 
that 


KCkJ-pmj 

Ira  Jj-a 


1 

where  v^(k) c  R  ,  and  thus  the  covariance 
matrix  of  v(k)  can  be  written  as 


Vk> 


'v  (k)  *  0  1 

l 

cf - - 

oLjXm^  J  mjXmj 


where  Vy  (k)  is  strictly  positive  definite. 

It  is  easy  to  see  that  there  exists 
a  non-singular  matrix  Q(k) ,  such  that  the 
trans  forma  t ion 

x(k)  -  Q(k)  x(k)  (3) 

yields  the  following  state  and  measurement 
equations 


where 


z,  (k+1) 


s,  (k) 


A(k)  -  Q(k+1)  A(k)  Q~1(k) 

S(k)  -  Q(k+1)  B(k) 

and 

C(k)  -  C(k)  Q"1(k) 

*^0*  R  l,x2(*)c  R^.y^*)*  R  1.y2(*)c  R*2, 
and  n^n-m^.  A(*)>  S(0  and  <?(•)  are  par¬ 

titioned  accordingly.  Moreover,  aince  C(k) 
is  of  full  rank,  ?22(k)  ia  invertible. 

Bence  there  exists  a  one-to-one  correspon¬ 
dence  between  the  state  z2(k)  and  output 

y2(k),  namely 

z2  (k)  -  C22(k)  y2(k)  (6) 

and  thua  in  the  state  equation,  (4) ,  only 
z^(*)  must  be  estimated.  The  dynamic 

equations  for  z^t*)  and  z2(*)  are 

z^k+1)  ■  Aj^tk)  z^(k)  +  A^tkJZjtkJ+lfj^tkJutk) 

(7) 

z2(K+1)  *  A21(k)z1(k)+3^2(k)z2(k)+ff2(k)u(k) 

(8) 

It  is  obvious  from  (7)  and  (8)  that  z^k) 
and  z2(k)  are  mutually  dependent;  there¬ 
fore  the  estimation  of  z^(k)  does  depend 
on  the  dynamic  behavior  of  z2(k).  Thua 
the  filtering  problem  becomes  that  of  find¬ 
ing  an  optimal  £^(k+l|k+l)  subject  to  (7) 

and  conatrained  by  (8),  where  £^(k+l|k+l) 

denotes  the  estimate  of  z^k+l)  given 

measurements  up  to  time  k+1.  Mote,  from 
(6)  and  (7),  that  the  state  s2(k)  can  be 

regarded  as  a  deterministic  input  in  the 
Kalman  filtering  problem. 

t.  THB  REDUCED- ORDER  OPTIMAL 
1MAT0R 
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a(k+l)»F(k)a(k)+G(k)  z2(k)+M(k)u(k)  (10) 

and 

y1(1c)>B(1c)a(1c)-*>N(k)z20c)-t>v10c)  (11) 

where 

F(k)  -  Xlt(k)  -  P(k+1)  jL.tk) 

11  ^  (10. a) 

G(k)  -  X12(k)  -  P(k+1)  Xj2(k)  +  F(k)  P(k) 

(10.b) 

M(k)  -  Sx(k)  -  P(k+1)  ff2(k)  (10. c) 
H(K)  -  ?n(k)  (11. a) 

and 

N(k)  -  Cu(k)  +  2u(k)  P(k)  (ll.b) 

Notice  that  P(k) ,  aa  defined  in  tha  above 
equations,  can  be  viewed  aa  the  Lagrange 
multiplier  in  the  atandard  conatrained 
optimization  problem.  Now,  the  problem 
of  eatimating  (k)  ia  replaced  by  that  of 

eatimating  a(k)  given  the  meaaurementa 
{yi(l)»yi(2)#....y1(k)}  and  atatea  (z2(l), 

z2(2) , . . . ,z2(k) }.  From  (9),  it  ia  obvioua 
that 

#(k|k)  -  ^(klk)  -  P(k)  z2(k)  (12) 

and 

V~(k|k)  -  V~  (k|k)  (13) 

a  *x 

where  fi(klk)  denotea  the  eatimate  of  a(k) 
conditioned  on  input-output  meaaurementa 
up  to  time  k,  and 

a(k|k)  A  ftOtlk)  -  a(k) 

z^klk)  A  ^(klk)  -  zx(k) 

V^(k|k)  A  B[a(k|k)  a^Jklk)  ] 

The  unbiaaed  linear  eatimator  of  a(k) 
ia  given  by  the  following  n^th- order 

filter 

#(k+l)  |k+l)-[l-K(k+l)H(k+l)]F(k)4{k|k) 

♦K(kel)  [y1(k+l)-N(k+l)  z2(k+l) 
-H(k+l)G(k)z2(k)]  (14) 

Hence  the  error  quantity  a(k|k)  propagatea 

aa 

a(k*l|k+l)-[X>X(k+l)H(k+l)]F(k)a(k|k) 

+  (I-K(k+1)  H(k+1)  ]M(k)  u(k) 
-X(k+l)v1(k+l)  (15) 

and  the  error  covariance  matrix  V~(.|*)  ia 
given  by 

V~Ow.l|lw>i)-(I-lC(k+i)H(k+l)  jr(k+l) 
_ _H-JC(k+l)H(k+l)  JT  _  . 


+K(k+l)Vv  (k+l)KT(k+l)  (16) 

where 

T  (k+ 1)  Ay  (k)  V~(  k  |  k)  FT  ( k)  +M  ( k)  Vu  ( k)  MT  ( k) 

8  u  (17) 

Obaerve  that  I*(k+1)  ia  eaaentially  the 
one-atap  prediction  error  covariance  matrix 

(9) .  An  optimal  eatimator  ia  taken  to  be 
an  eatimator  which  miniaizea  the  trace  of 
the  error  covariance  matrix.  Therefore, 
it  ia  left  to  minimize  Tr [V~(k+l|k+l) ] 

with  reapect  to  (K(k+1); P(k+1) ] .  Notice 
that  here  K(k+1)  plays  the  role  of  atand¬ 
ard  Kalman  gain  while  P(k+1)  ia  the 
Langrangian  of  the  optimization  problem. 
Minimizing  Tr  (V~(k+l]k+l)  ]  with  respect  to 

K(k+1)  yields 

K*(k+1)  »  r(k+l)  HT(k+l)  R“1(k+1)  (18) 

where  R( • )  is  the  positive-definite  symme¬ 
tric  matrix  given  by 

R(k+1)  -  H(k+»r{k+l)HT(k4-l)+V„  (k+1) 

V1 

(18. a) 

Obaerve  that  (15)- (18)  are  identical  to 
the  formulation  of  the  standard  Kalman 
filter  (9).  However,  in  this  case,  it  is 
further  required  to  optimize  the  state 
estimator  with  reapect  to  the  choice  of 
P(k+1)  ,*  i.e.  minimize  Tr  [V~(k+l|k+l)  ]  with 

respect  to  P(k+1) .  Let  the  optimal  P(k+1) 
which  minimizes  Tr [V~(k+ l|k+l) ]  be  denoted 

by  P*(k+1).  Then  it  can  be  shown  that 

P*(k+1)  c  Z*(k+1)  (19) 

where  Z*(k+1)  is  the  set  given  by 

n.xm. 

Z*(k+1)  -  {P:  P  A,(k)-A,(k),  P  s  R  1 

1  2  (19. a) 

Ax(k)  -  X21(k)v~(k|k)}51(k)+S2(k)vu(k)Bj(k) 

(19. b) 

A2(k)  -  A11(k)v~(k|k)351(k)+8ri(k)vu(k)Sj(k) 

(19. c) 

Notice  that,  when  A^(k)  ia  non-aingular, 
P*(k+1)  is  given  by 

P*(k+1)  -  A2(k)  A^lk)  (19. d) 

The  set  Z*(k+1)  will  be  discussed  in  the 
next  section. 

Once  P*(k+1)  is  found,  the  matrices 
F(k) ,  G(k) ,  and  M(k)  can  be  specified  and 
denoted  by  F*(k),  0*(k),  K*(k),  reapective- 
ly,  by  substituting  P*(k+1)  and  P*(k)  into 

(10) .  Similarly,  N*(k+1),  r*(k+l)  and 
K*(k+1)  can  be  obtained  from  (ll.b),  (17), 
and  (18. a) . 

All  in  all,  the  reduced-order  optimal 
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■tat*  estimator  ia  formulated  by  the  fol+ 
louring  equations 

a(k+llk+l)  -  lZ-K*(k+l)H(k+l)]F*(k)t(k|k) 

+  K*(k+1)  (y^k+1)  -  H*(k+1) 

*2(k+l)  -  H(k+l)G«<k)z2(k)1 

(20. a) 

^(k+Hk+1)  -  A(k+l|k+l)  +  P*(k+1)  z2(k+l.) 

(20. b) 

z,(k+l)  -  eti(k+l)  y3(k+l) 

2  22  2  (20. c) 

V~(k+l|k+l)  -  (I-K*(k+l)H(k+l)  U**(k+1) 
z  (20. d) 

1C*  (k+l)  -  r*(k+l)HT(k+l)  (H(k+l)r*(k+l) 

-HT(k+l)  +  V  (k+11]’1  (20. e) 

V1 

r*(k+l)  -  F*(k)V~(k|k)P*T(k)  +  M*(k)7u(k) 


M*T(k)  -  [I:-P*(k+1)  ]  [A  (k) 
V~(k|k)AT(k)  +  ff(k)Vu(k)ST(k)] 


where 


[I:-P*(k+l)]J 


(20. f) 


*  T  V*z<fclk>  Om.xm,  ' 

v~(k|k)  «  -0- - ••  75  — - 

L  m2*ml  j  m2Xm2 

due  to  the  fact  that  z^(k)  ia  exactly 

measurable  for  every  k.  Also,  the  error 

covariance  matrix  V~(k+l|k+l)  is  given  by 
z 

V~(k+1  |k+l)  -  [(r*  (k+l)  )"*■  +  HT(k+l) 

Z 

V  (k+l)  H(k+1)  J”1  (20. g) 


and  P*(k+1)  is  specified  by  Eqs.  (19) 


4.  COMMENTS  OH  THE  REDUCED- ORDER 


The  formulation  for  the  optimal  state 
estimator  derived  in  last  section,  (19)- 
(20),  is  identical  to  that  of  the  "hybrid 
estimator"  given  in  (8) ,  except  that  in 
(8)  a  deterministic  input  is  inserted  to 
the  system  dynamics.  However,  the 
approach  her*  is  more  straightforward  and 
it  is  clearer  here  that  the  choice  of 
P(k+1)  is  crucial  to  the  optimality  of  the 
estimator.  It  can  be  seen  that  the  gen¬ 
eral  coordinate  transformation  diacussed 
in  [8J  is  split  into  two  coordinate  trans¬ 
forms  tionst  on*  which  depends  on  the  sys¬ 
tem  output  matrix  H  only  and  on*  which 
depends  on  the  matrices  Pj^  and  P^2.  Zt 

can  be  seen  that  the  similarity  transfor¬ 


mation  Q(*)  of  (3)  is  equivalent  to  J'~^’(*) 
defined  in  (14)  of  (8),  and  thus  the  state 
variable  z  defined  in  (3)  can  be  regarded 
as  equivalent  to\C  defined  in  (14)  of  (8). 
Furthermore,  the 'variable  ■  defined  in  (9) 
can  be  similarly  related  to  the  variable 
5X  in  (20)  of (8). 

The  relations  in  (19)  which  govern 
the  choice  of  P*(k+1)  are  vital  to  the 
understanding  of  the  optimal  state  esti¬ 
mator,  and  thus  deserves  some  detailed 
discussion.  First,  notice  that 

h(A1(k))  ch(A2(k)) 


where  h(A^(k))  and  lt(A2(k))  denote  the  null 
■paces  of  Aj,(k)  and  A2  (k) ,  respectively. 

The  following  observation  is  thus  made: 
Observation  The  set  2  + (k+l)  given  in  (19. a) 
is  a  non-empty  set.  Moreover,  if  A^(k)  is 

singular,  any  member  P(k+1)  c  Z*(k+1)  yields 
the  same  estimator  performance. 

How,  according  to  the  value  of  A^tk), 


the  following  special  cases  are  of  interest: 


Case  1:  A^(k) 


In  this  case. 


A2(k)  ■  and  thus 


n.xm 

I* (k+l)  -  R  ■L 


2 


This  case  is  possible  if  (8)  does  not  con¬ 
tain  any  information  pertaining  to  the 
estimation  of  z^(k) ;  for  example,  if 


"  °m2xni  and  V10  “  °m2xp-  ex" 

treme  example  for  this  case  is  that  m2  - 

Of  i.e.  all  measurements  are  noise  corrupt¬ 
ed.  Zn  this  condition,  the  estimator  pre¬ 
sented  in  Section  3  is  identical  to  the 
standard  full+order  Kalman  filter  whose 
performance  is  independent  of  the  choice 
of  P(k+1) . 

Cf  2:  Ax(k)  a  0  i.e.  Ax(k)  is  a 

singular  non-zero  matrix.  Zn  this  case, 
only  seme  components  of  z2(k)  contain  in¬ 
formation  about  (u(k),  Zjik)).  Thus  the 

similarity  transformation  discussed  in 
Section  2  can  be  redefined  so  as  to  iso¬ 
late  only  those  elements  of  z2(k)  which 

constitute  a  constraint  on  (u(k) ,s^(k) ) . 

Hence  the  Lagrange  multiplier  P(k+1)  that 
should  be  considered  is  an  element  in 

j^rwrJ-xr,  r  <  Alternatively, 

any  member  in  t* (k+l)  can  be  used  in  the 
filter  realisation. 

Case  3 i  A^(k)  is  positive-definite,  this 
condition  can  be  fulfilled  when  Vu(k)  is 

.positive-definite  for  any  k  ■  0,1,2, . 

Zn  this  case,  z*(k+l)  contain*  on*  and 
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only  on*  clamant  P*(k+1),  which  la  given 
by  (19. d). 

When  P*(-)  ie  uniquely  specified 
(Case  3) ,  one  can  compare  the  error 
covariance  matrix  given  by  (20. g)  with 
that  obtained  for  arbitrary  P(*)  and  ob¬ 
serve  the  same  expression  for  V~(k+l|k+l) . 

The  difference  is  that  r*(k+l)  of  (20. f) 
has  the  following  property 

Tr  IT*  Oc+1)  ]  <  Tr  (TOt+1)  1 
where  !*(•)  is  obtained  from  non- optimal 
P(-). 

Finally,  the  implementation  of  this 


estimator 

should  be  initiated  as  follows: 

$(0|0)  -  Slx(0)] 

"  X0 

i.e. 

41(0|0)  -  E (Zj'(O)  ] 

therefore 

V~(0|0)  - 

IX  (0>  1  °m  xm 

1*1 _ 1  _1_2_ 

*  Q(0) V  QT(0) 
X0 

and 

P*<°>  *  Vm, 

X  4 

> 

5.  CONCLUSION 

A  reduced-order  optimal  state  esti¬ 
mator  for  a  linear,  discrete-time  system 
associated  with  a  singular  noise  covari¬ 
ance  matrix  has  been  derived  in  this  paper. 
The  main  idea  in  this  derivation  is  to 
cast  this  singular  state  estimation  problem 
as  a  constrained  optimization  problem. 

The  estimator  derived  here  is  fundamental¬ 
ly  the  same  as  that  derived  by  Fairman  [8] . 
The  major  differences  are:  the  approach 
here  is  more  straightforward,  the  optimal¬ 
ity  of  the  estimator  is  more  explicitly 
exposed  and,  furthermore,  the  possibility 
of  nonuniqueness  of  P*(*)  is  discussed 
here. 

Zt  is  worth  mentioning  that  the  esti¬ 
mator  given  here  requires  lower  order 
matrix  inversion  than  the  standard  full- 
order  Kalman  filter  does  in  the  singular 
case;  thus  the  computational  efficiency 
is  improved.  This  estimation  procedure 
can  be  applied  similarly  to  amoothing  and 
predicting  problems  or  systems  with  colored 
noise . 
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In  this  correspondence  we  derive  the  true  optimal  reduced-order  state 
estimator  for  a  linear  system  with  singular  noise  covariance  matrix.  The 
results  reported  here  are  by  no  means  new.  Fairman  [7]  has  derived  the 
same  estimator  using  his  concept  of  “optimal  coordination."  However. 
Fairman’s  procedure  is  somewhat  indirect  and.  thus,  the  optimality  of 
his  hybrid  estimator  is  not  transparent. 

The  derivation  of  the  optimal  estimator  here  is  based  on  the  observa¬ 
tion  that  the  singular  stale  estimation  problem  can  be  reduced  to  a 
simple  constrained  optimization  problem  which  yields  a  Kalman-type 
estimator. 

The  correspondence  is  organized  as  follows.  The  problem  is  formally 
formulated  and  in  Section  III  its  solution  is  presented.  In  Section  IV  the 
state  estimator,  as  presented  here,  is  discussed. 

II.  Problem  Formulation 

The  standard  Markov- Gauss  model  is  considered,  i.e„  the  state  equa¬ 
tion  is  given  by 

x(k+\)~A(k)x(k)  +  B(k)u(k)  *-0,1,2.- (1) 
and  the  measurement  equation  is 


y(k) 

•  C(k)x(k)  +  v(k)  km  |.2.--- 

(2) 

where  x(-)£lS,',  u(- 

)£&'.  and  y(  )e9 lm.  x(0 ),  u(0).  u(l),- 

•.  c(l). 

t;(2),*  •  • ,  are  independent  Gaussian  random  vectors  with  the  following  1 

statistics: 

f-J 

£[x(0)]-x 

2  cov]x(0)]-2 

(3) 

£[u(i)]-0 

E[u(i)u'(j)\myil(i)SO-j)  Vi.j 

(4) 

y 

£[c(i)]-0 

E\ o(i ) v\j )] -  F„( i ) i 5( i  — j )  Vi,  j 

(5) 

£(u(i)t)(/)|-0 

£[u(/)x'(0)]  — 0  £[u(/)x'(0)]  —  0 

(6) 
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Abstract — The  problem  of  reduced-order  optimal  state  estimation  for 
linear  systems  with  singular  noise  covariance  matrix  b  studied.  It  b  shown 
that  the  optimal  estimator  b  somewhat  different  from  the  Kalman  filter. 

The  state  ratlamtnr  problem  in  the  singular  case  can  be  cast  as  a 
coastndned  opd ablation  problem.  Solving  thta  optimization  problem  yields 
the  truly  optimal  estimator.  The  estimator  derived  here  b  of  the  form  of 
the  hybrid  estimator  of  Fairman  [7).  However,  the  derivations  here  are 
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where  v'(  j)  denotes  the  transpose  of  the  vector  c(  j).  It  is  assumed  that 
F„(i)  are  nonnegative  definite  matrices  with  rank  m-m,.  where  m,  <  m. 
In  this  case,  the  standard  Kalman  filter  solution  involves  the  inversion  of 
a  matrix  which  may  be  singular.  Furthermore,  as  shown  in  [1).  an 
observer- estimator  of  order  ti-m,  may  be  constructed  which  performs 
as  well  as  the  higher  order  state  estimators. 

Thus,  our  objective  is  to  construct  the  optimal  n-mt  state  estimator. 
With  no  loss  of  generality  [1],  we  assume  that 


I.  Introduction 

The  problem  of  state  estimation  with  partially  noise  corrupted  mea¬ 
surements  is  of  practical  importance  since  it  is  often  the  case  that  certain 
measurements  an  significantly  more  accurate  than  other.  This  intro¬ 
duces  the  necessity  to  construct  the  state  estimator  assuming  a  singular 
(almost  singular)  noise  covariance  matrix  (henceforth,  referred  to  as  the 
singular  estimation  problem.)  Furthermore,  it  is  well  known  that  if  the 
measurement  noise  is  colored,  the  estimation  problem  can  be  formulated 
as  a  singular  estimation  problem  [1],  [2],  Surprisingly  enough,  with  all  the 
attention  given  to  this  problem,  standard  text  (e.g.,  [11,(2])  and  an 
abundance  of  papers  (e.g.,  [3]-(3D  present  a  variety  of  suboptimal  rather 
than  optimal  solutions  to  this  classical  problem. 


where  and,  thus,  the  covariance  matrix  of  v(k)  can  be 

written  as 

m  —  mx  m, 

•'(*)-[  Fci(*)  ;  0]  )m-mt  (5b) 

6  ' 6J  }m, 

where  V„t(k)  is  strictly  positive  definite.  It  is  simple  to  see  that  there 
exists  a  nonsingular  matrix  Q{k)  such  that  the  transformation  Z(*)- 
Q(k)x(k )  yields  the  following  state  and  measurement  equations 


Z,(*  + 1) 

*i(*  +  6 


^n(fr)  .  'fu(^)  Z,(*) 

,^2l(*)  !  ^22(*)  I  Zj(V) 


*1  (*) 

+■ 


«(*)  (6) 


>■,(*)]  r <?,,(*> :  cn(k) |[ z,(*>  '„,(*) 

....  -  - - + - 


l  0  .  Cn(k)  J L  Z2(  A )  j 

i(*)-C(*+D^(*)(?-|(*) 

i(*)-(?(*+U£(*) 

C(k)~C(k)Q-,(k). 
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and  where  Z,(-)£9 1"  and 

the  matrices  A,  3,  and  C  are  partitioned,  accordingly.  Furthermore,  we 
assume  with  no  loss  of  generality,  that  C(k)  is  of  full  rank  and,  thus, 
Cn(k)  is  invertible.  This  implies  that 

Zi(k)-C^(k)yt(k)  (8) 

and,  thus,  in  the  state  model  (6)  and  (7)  only  Z,(  ■ )  must  be  estimated.  To 
emphasize  this  point,  we  rewrite  (7)  as  two  equations 

Zx(k+\)-Alx(k)Zx(k)+Al2(k)Z2(k)  +  Bx(k)u(k)  (9) 

and 

Z2(k+\)-A2X(k)Zl(k)+A22(k)Z2(k)  +  B2(k)u(k).  (10) 

The  filtering  problem  is,  thus,  that  of  finding  the  optimal  Zx(k+  l/k+ 
1)  subject  to  the  dynamic  equation  (9)  and  the  constraint  in  (10).  Note 
that  the  unmeasurable  quantities  in  (9)  are  Zx(k)  and  u(k).  These 
quantities  are  constrained  by  equality  (10).  The  vector  Z2(k)  can  be 
handled  as  a  deterministic  input  in  the  Kalman  filtering  problem. 

In  the  next  section,  we  derive  the  optimal  observer  for  Zx(k),  where 
the  performance  measure  is  the  trace  of  the  error  covariance  matrix. 

III.  The  Optimal  State  Estimator 
Defining  the  variable  S(k ) 

S(k)-Zx(k)-P(k)Z2(k);  F(  *)€«.<"- "">*«" ,  (11) 


we  obtain  from  (7),  (9),  and  ( 10), 

S(*+  l)-F(k)S(k)  +  G(k)Z2(k)  +  M(k)u(k)  (12a) 

yx{k)-H(  k)S(k)  +  N(k)Z2(k)  +  vx(k)  (12b) 

where 

F(k)-Axx(k)-P(k+l)Alx(k)  (12.1) 

G(k)-Ax2(k)-P(k+l)A22(k)  +  F{k)P(k)  (12.2) 

M(k)-Bx{k)-P{k+l)B2(k)  (12.3) 

H(k)-Cu(k)  (12.4) 

N(k)-Cn(k)  +  Cxx(k)P(k).  (12.5) 


We  note  that  P(k)  can  be  viewed  as  the  Lagrange  multiplier  in  the 
standard  constrained  optimization  problem.  Now,  the  problem  of  esti¬ 
mating  Zx(k)  is  replaced  by  that  of  estimating  S(k)  given  the  measure¬ 
ments  (>',(1),>'I(2), -  •  • ,  y,(k)}.  Obviously,  from  the  definition  of  S(k)  in 
(11)  we  have 

S(k/k)~Zx(k/k)-P(k)Z2(k )  (13) 

and 

V^k/k)-Vt(k/k)  (14) 

where 

S(k/k)-S{k/k)-S(k) 

Zx(k/k)-Zx(k/k)-Zx(k) 

Any  unbiased  estimator  of  S(k)  is  given  by  the  (n-m^th  order  filter 

i(*+l/*+ !)-(/- Af(k  +  l)//(*+l)]/,(k)i(*/*)  +  8r(k-t-l) 

•[*(*+  !)-*(*+ l)Z2(*-M)-//(*+l)C(*)Z,<*)]  (19) 

S(k+\/k+\)-[!-K(k+\)H(k+l)]F(k)S(k/k) 

-l/-K(k+l)H(k+l)]Af(k)u(k)+K(k  +  l)vx(k+\).  (19a) 

To  obtain  the  optimal  estimator,  we  have  to  minimi«  the  trace  of  the 


error  covariance  matrix 

Kj(*+l/*+l)-{/-*(k+l)//(*  +  l)ir(k-H) 

[I-K(k+\)H{k+\)\'  +  K{k+\)V0)(k+\)K{k+\)  (20) 

where 

r(k+l)-F(k)l'i(k/*)r(Ar)-t-Af(*)Ky(*).t#,'(k)  (21) 

with  respect  to  (Af(k+  1),  P(k+  1)].  We  note  that  K(k  +  1)  plays  the  role 
of  the  Kalman  gain,  whereas  P(k+  1)  is  the  Lagrangian  of  the  opumiza- 
tion  problem.  Furthermore,  we  note  that  V(k  +  1)  is  the  one-step  predict¬ 
ion  error  covariance  matrix  Kf(  k  +  1  /k ).  Minimizing  tr  ( Fj(  k  +  l  /k  +  I )] 
yields 

*•(*+  l)-r*(k+  l)H’(k+  l)R-'(k+  1)*  (22) 

where  R(  • )  is  the  positive  definite  matrix 

R(k+l)“//(k+l)r(Ar  +  l)H'(k+  1)+  F<1(/(r+  1).  (22a) 

The  optimal  P(k  +  1)  is  denoted  by  Pm(k+  1).  R*(-)  and  T*(-)  in  (22) 
denote  R(  )  and  T()  of  (22a)  and  (21),  respectively,  expressed  as 
functions  of  the  optimal  P‘(k+  1).  Optimization  with  respect  to  P(k+  I) 
yields 

P*{k  +  \)e9*(k+\)  (23«) 

where  9*(k+  I)  is  the  set 

3>*(*  +  l)- {/>gi5ic-'»i)»|:  P\,(k)m\2{k)}  (23b) 

A,( k) k )FS( k/k)A’2x( k )  +  ij( k )K„( k )i;(  k  )  ( 23c) 

\2{k)-Axx{k)Vs(k/k)A'2x(k)  +  Bx(k)Vu(k)8i(k).  (23d) 

The  relations  in  (23)  are  vital  to  the  understanding  of  the  optimal  filter 
and.  thus,  merit  some  discussion.  Since  9L(A,(k))c"9l(  A2(k )),  where 
91  ( ,4)  denotes  the  null  space  of  A,  the  following  observation  is  made. 

Observation :  9*(k  +  I)  is  not  an  empty  set.  Furthermore,  if  '?*( k  +  1) 
is  not  a  singleton  (A,(k  j-singular),  any  member  P(k+  l)e9*(k+  1) 
yields  the  same  filter  performance. 

The  invariance  of  the  performance  over  the  set  9*(k+  1)  can  easily  be 
checked  via  consideration  of  the  error  covariance  matrix  as  in  (25) 
below. 

According  to  the  value  of  Ax(k),  the  following  special  cases  are  of 
interesL 

Case  1:  A|(k)— 0.  Note  that  in  this  case  A2(k)«0  and,  thus,  ’?•(*  + 
1)  —  '"'i'"'.  This  case  is  possible  if  (10)  does  not  contain  any 

information  pertaining  to  the  estimation  of  Zx(k).  For  example,  if 
^2i(k)-0  and  B2(k)  —  0,  etc. 

Case  2:  A,(k)>0  (A,(k)  is  a  singular  nonzero  matrix.)  In  this  case, 
only  some  components  of  Z2(k)  contain  information  on  (u(k),  Z,(  A )). 
In  this  case,  the  transformation  Q(k)  discussed  in  the  previous  section 
can  be  defined  so  as  to  isolate  only  those  elements  in  Z2(k)  which 
constitute  a  constraint  on  (u(k),  Z,(k))  and.  thus,  the  Lagrange  multi¬ 
plier  P(k)  that  should  be  considered  is  in  9t<a~r>’'  where  r  is  the  number 
of  such  constraints  r<m,.  Alternatively,  any  member  in  9*(k+  1)  can 
be  used  in  the  filter  realization. 

Case  3:  A((k)>0.  In  this  case  P*(k+  I)  is  uniquely  given  by 

f»*(*+l)-A2(*)Af‘(*).  (24) 

Thus,  the  optimal  filter  equations  are  given  by 

5(  k  + 1  /k  +  1 )  -  [  I - *•(  k  A- 1  )H(  k  +1 )]  f(  *  )S(  k/k  ) 

+  **(  k  + 1 )[  y ,( *  +  1 )  -  \»(  k  +  1  )Zj(  k  +1 ) 
-//(*+  l)G*(*)Zj(*)j  (25a) 

(25b) 


Z  ;k+l/k+l)«S(k  +  1/A+  l)-f  F*(k+  l)Z2(t+  1) 
Z,(*+l)-Ca'(*+l)>-2(*+1) 


(25c) 
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V2*(k+l/k+ !)-[/- K*{k+  \)H(k+  l)]r*(*+l) 

-  r*(*+ i)-r*(*+  \)H\k+  i)[//(*+i)r*(fc+i) 

■/#'(*+ l)+l'(*  +  l)rltf(*+l)r*(*+l) 

-  [(r*(*+  1))“'  +  H'(k+\)V0(k  +  l)H(k  +  l)] _1 


where  the  equality  -  is  meaningful  only  if  the  inverse  exists. 

r\k+i)mr(k)vj(k/k)F*(ky+M(k)Ya(k)Ar(ky 

-[/  :  - P*(k  +  \)][A(k)v;(k/k)AXk) 

+  BW  W)BU)}[-p.'k+l) 


,  Vf(k/k)  '  O' 

v!(k/k)- 

o  ;  o. 

due  to  the  fact  that  Z2  is  exactly  measurable. 

Remarks: 

1)  Comparing  the  error  covariance  in  (25d)  with  that  obtained  for  an 
arbitrary  /*(•),  we  observe  the  same  expression  for  V(k+  \/k+  1)  as 
function  of  !*(*:+ 1).  The  difference  is  that  r*(k  +  1)  of  (25e)  satisfies 
r*(*+l)<r(*+l)  where  T(Ac  + 1)  obeys  (25e)  with  nonoptimal  P(). 
(See,  for  example,  expression  (36)  of  (3].) 

_  2)  r*(*+  1)  of  (25e)  represents  the  covariance  of  the  prediction  error 
Z,(*+  \/k)  as  can  be  easily  proved.  Thus,  our  optimization  may  be 
viewed  as  two-step  optimization: 

Step  I :  finding  the  optimal  predictor. 

Step  2:  finding  the  optimal  filter. 

The  optimal  gain  Km(k+  l)  is  given  by 

K*(fc+l)-r*(*+0«r(*+l) 

[H(k+  1)T*(*  +  \)H'(k  +  1)+  yw>(k+  l)]-1  (26a) 

and  the  La  gran  gi  an  satisfies 

P‘(k  +  l)e^*(Jfc+l)  (26b) 

where  9*(k+ 1)  is  given  in  (23).  Equations  (25)— (26)  are  initiated  by  the 
values 

5(0/0) -£(  5(0)]  or  i,(0/0)-£(Z,(0)] 

F*(0)-0 

^,(0/0)-  yZi(  0). 

Remarks: 

1)  If  all  the  measurements  are  noise  corrupted,  i.e.,  m,- 0,  our  filter 
reduces  to  the  standard  Kalman  filter. 

2)  If  all  measurements  are  noise  free,  i.e..  m,  -m,  our  filter  performs 
exactly  as  the  “optimal  minimal-order  observer”  of  (2j  as  can  be  seen 
from  the  comparison  of  our  error  covariance  matrix  [(25d)-(25e)j  to 
theirs  ((42)  in  [4|] 

The  optimal  state  estimator  of  (26)  is  depicted  in  Fig.  1. 


Fig.  1.  The  reduced -order  optimal  estimator. 


/>(•),  i.e.,  Pm{-).  Note  that  the  innovations  are  white  regardless  of  the 
choice  of  P(  • ),  which  led  to  the  erroneous  conclusion  that  the  optimality 
is  achieved  by  considering  Z2()  to  be  a  deterministic  input  and  subse¬ 
quently  ignoring  (10). 

Finally,  it  should  be  noted  that  in  constructing  an  optimal  predictor  or 
an  optimal  smoother,  the  same  procedure  as  presented  in  Section  III 
may  be  utilized.  Furthermore,  the  idea  of  solving  the  singular  state 
estimation  as  constrained  optimization  problem  extends  straightfor¬ 
wardly  to  the  continuous  time  case. 
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IV.  Discussion 

As  stated  earlier,  the  state  estimator  derived  here  is  fundamentally  the 
same  as  that  derived  by  Furman.  The  major  differences  are:  1)  our 
approach  is  direct  and,  thus,  the  optimality  of  the  estimator  is  guaran¬ 
teed;  and  2)  in  Fairman's  paper,  a  filter  equivalent  to  ours  is  presented, 
but  the  possibility  of  nonunique  /’*(■)  is  overlooked  (cases  l)-2)  of  the 
previous  section). 

The  reader  might  wonder  about  the  results  here  as  compared  to  those 
found  in  literature.  The  difference  is  in  the  optimal  choice  of  the  matrix 
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Spectral  Analysis  of  Previewing  Controllers 

ELI  FOGEL  and  KEYING  McGILL 


Abstract — Analytical  wqwhi—  far  the  perform— m  of  apW  and 
anbopttaMi  control  poHdco  in  trajectory  tracking  pwhlani  are  derived. 
Special  attendee  la  devoted  to  the  finite  preview  proNw,  In  the  dme- 

1 - *n-t  ‘"ifWtr  •‘ratlin  rear  rmmtpmuhiaini  nf  Ihi  put . wan  IniBrra 

for  finite  energy,  finite  power,  and  riationary  wnrhnetlr  trajectories,  all 
with  the  one  spectral  density,  is  rarahHahti.  Using  tfala  rmnupnndian 
the  parforwance  of  different  control  pohdoa  is  stadkd  hi  the  freqncncy 
domain.  Slaialarina  remits  demonstrate  the  faefflty  of  the  theory  of  tWa 


I.  Introduction 

One  of  Bellman’s  major  contributions  to  optimal  control  theory  is  his 
principle  of  optimality  (2J.  This  principle  implies  that  to  optimally  steer  a 
system  along  a  trajectory  the  entire  trajectory  must  be  available.  Unfor¬ 
tunately.  moat  practical  controllers  can  only  preview  the  jw»ww«»t. 
future  of  the  trajectory  and  must  rely  on  statistical  information  to 
characterize  the  trajectory  beyond  their  preview.  A  common  example  of 
a  previewing  controller  is  the  driver  of  an  automobile,  and  we  refer  to  a 
previewing  controller  aa  a  driver,  and  to  a  driver  with  complete  trajec¬ 
tory  information  aa  a  prescient  driver. 

While  control  policies  for  previewing  controllers  have  been  presented 
in  the  literature,  little  attention  has  been  paid  to  evaluating  their  perfor¬ 
mance.  Tomizuka  and  Whitney  [7],  [8]  have  derived  the  optimal  driver 
under  the  assumption  that  the  trajectory  is  generated  by  a  known  linear 
system  driven  by  Gaussian  noise.  They  also  give  a  ride  of  thumb  for 
calculating  the  preview  length  which  results  in  good  performance.  Miller 
[4]  has  presented  a  simple,  wed- performing  driver  with  no  a  priori 
knowledge  of  the  trajectory’s  statistics.  Fogel  (3]  has  evaluated  the 
difference  between  the  control  values  of  the  optimal  driver  and  the 
optimal  prescient  driver.  However,  until  now,  simulation  has  been  the 
only  way  to  evaluate  the  performance  of  any  of  these  driven. 

The  purpose  of  this  paper  is  to  derive  analytical  expressions  for  driver 
performance.  In  Section  II  arbitrary  trajectories  are  considered  in  the 
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time-varying  case.  An  integral  expression  for  the  performance  is  derived 
which  depends  explicitly  on  the  trajectory  and  the  preview  length  hut  is 
independent  of  the  course  of  the  plant  state  for  a  broad  class  of  linear 
driven.  This  result  employs  the  concept  of  performance  penalty  which 
waa  used  by  W emersion  to  solve  the  related  optimal  pursuit  problem 
(10).  Aa  an  example,  this  result  is  used  to  optimize  a  driving  policy. 

In  Section  III  the  expected  performance  in  following  a  stochastic 
trajectory  of  known  spectral  density  is  considered.  The  problem  is 
restricted  to  the  infinite-duration  time-invariant  case  to  allow  analysis  in 
the  frequency  domain.  The  key  result  is  that  the  expected  performance 
in  following  a  stochastic  trajectory  may  be  calculated  by  considering  a 
much  simpler  deterministic  trajectory  with  the  same  spectral  density. 

Using  this  result,  the  performances  of  the  optimal  prescient  driver,  of 
Miller’s  driver,  and  of  Tomizuka's  driver  are  calculated  in  the  frequency 
domain.  It  the  spectral  density  can  be  identified  as  that  of  the  output  of 
a  linear  system  driven  by  white  noise,  the  performance  can  be  related  to 
the  solution  of  algebraic  Riccati  equations. 

The  results  in  Sections  II  and  111  are  stated  for  continuous  time.  The 
corresponding  discrete  time  relations  are  reported  in  the  Appendix.  In 
Section  IV  computer  simulation  results  are  discussed.  These  simulations 
demonstrate  the  validity  of  the  theory  and  provide  some  insight  into  it 
Section  V  concludes  the  paper. 

II.  The  Driving  Problem  and  Performance  Evaluation 

The  difference  between  a  controller’s  actual  performance  and  the 
performance  of  the  optimal  prescient  controller  may  be  thought  of  as  the 
controller’s  penalty.  The  optimal  prescient  performance  is  known  from 
the  optimal  tracking  problem,  and  a  remarkably  simple,  but  less  well- 
known,  expression  for  penalty  has  been  presented  by  Weraersson  [10].  In 
this  section,  these  ideas  are  applied  to  the  time-varying  driving  problem. 

The  plant  to  be  controlled  is  described  by  its  state  equation 

x(l)mA(i)x(f)+B(t)u(t);  x(r0)-jc0  (2.1a) 

where  x{l)eRx,  u(t)GX',  and  A(i)  and  B(l )  are  matrices  of  ap¬ 
propriate  dimensions.  The  system  matrices  are  assumed  to  be  time 
varying  throughout  this  section  although  for  notational  convenience  the 
time  dependence  will  often  be  suppressed. 

The  control  policy  is  assumed  to  be  of  the  form 

«(>i.((,r(l),{yr(T):r<(l+r)}) 

where  Tr-  [yT( f):  rs[i#, tf\,  yr(r)e/l*}  is  the  trajectory  to  be  fol¬ 
lowed,  and  T  is  the  preview  length.  Linear  control  policies  of  the  form 

u(l,x(f).rJ.)--K(r)x(r)+  ('’H(t,r)yT(r)dr  (2.2) 

J>o 

will  be  denoted  by  the  pair  ( X(t),  HO,  r)).  HO,  r)  can  be  considered  the 
impulse  response  matrix  of  a  noncausal  “trajectory  filter,"  and  the 
preview  constraint  means  that  H(l,  r)-0  Vt>i+  T.  The  state  trajectory 
of  the  plant  for  a  particular  u(-),  r0,  x0,  and  Yr  will  be  denoted  by 

X(t0.x0,Yr,  u())m{xO;  i0,x0,rT.  «(•)):  ref/,.//]}. 

The  control  objective  is  to  minimize  the  performance  index 

f''Ht,xO;t0,x0.Yr,  «<•)>.  yr,u())dr 

(2.3a) 

where 

L(<,  xO),  YT,  “(•))"  || r )-yr(r)||^(,)+ 1| «(r,  x(i),  Fr)||i(() 

(2.3b) 

3>(r)-C(i)*(r);  »(!)€*♦  (2.1b) 

and  £(r)>0  and  A(r)>0. 
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The  characteristic  which  ^T'gv**^**  the  driving  problem  from  tbs 
familiar  tracking  problem  of  optimal  control  theory  (see,  e.g.,  [1])  is  that 
only  •  subset  of  Yf  is  available  to  tbs  controller.  If  tbs 

problems  are  identical  and  the  controller  will  be  called  prescient. 

The  optimal  prescient  control  policy  is  known  from  the  tracking 
problem  [1]  to  be 

u°(t.x(t),  rT)--R-'B'r(t)x(t)+R-'B'i(t)  (2.4a) 

where 

P(t)~-PA-A’P+PBR-'B’P-CQC;  /’(r/)-0  (2.4b) 
iU)-l-A’  +  I>BR-'B’]i-C'QyT;  {(/,)-0  (2.4c) 


This  idea  is  illustrated  in  Fig.  1.  The  proof  is  given  after  the  following 

corollary. 

Corollary  2.1:  The  penalty  paid  by  the  control  policy  «(•)- 
( JC°(t).  H(t,  r))  is  independent  of  *(/„.  yt<  “(  ))  “d  “  given  by 

W,o.^o.>’r.-(  ))-/'/(/'/I"('.T)-"<’(<.r)Jyr(0<frJ  dr. 

(2.8c) 

Proof:  In  this  proof  the  notation  x(t;  x0,  Yr,  u(-))  will  be  ab¬ 
breviated  to  x(t;  ■).  The  method  used  is  similar  to  that  of  Wernesson  in 
(10).  Let 

p(r)  * y(r,Jr(r;  ).r„ »•(-))-*'(/;  )F(r)x(f;  )-2{'(r)x(r;  •)+«/) 


Notice  that  ((/)  is  independent  of  X(t0,x0,  YT,  u°(  )X  and  that  Pft)  is 
independent  of  both  X(t0,x 0,  Yr,  «*(•))  sad  Yr.  **(•)  is  of  the  form  of 
(22)  and  will  be  denoted  «0(  )-(/r°(/), /f#(r.  r)\  where  *°(/)- 
R~‘B'P(t)  and  /f4(t,r)  is  the  impulse  response  of  the  trajectory  filter 
whose  state  equation  is  (2.4c).  This  filter  is  purely  anticipatory,  that  is, 
H°(t,  T)«*0  Vr<r.  The  performance  index  for  this  driver  is  presented  in 
a  new  way  in  the  following  lemma. 

Lemma  I: 

+  r'MD-CP-'iOHm^dt.  (2.5) 

Thus  the  optimal  performance  index  is  the  sum  of  a  term  which  can  be 
made  to  vanish  by  a  proper  choice  of  x0  and  a  term  which  is  indepen¬ 
dent  of  X(t0,  x  0,yT,  «*(•)). 

Proof:  The  proof  employs  the  technique  used  by  Wemcrsaon  in 
(10).  The  familiar  expression  for  J(t0,  x0,  Yr,  u°(  ))  is,  from  (U 

/(r0,x0,rr,u°(  ))-xiF(/0)x0-2xi«/0)+f(r0)  (2.6a) 


From  (2.6),  p(<0)“  A<o-  *0*  Yr>  “°(  )>  “d  p(»/)-0,  and  so  from  (2.8a) 
and  (22a) 

J,JL 'o. *o.  *r .»())- 1 '1 U t.  x(r.  •),  Yt,  u( ■))  +p(r )J  dr. 

Differentiating  p(r),  substituting  the  appropriate  expressions  from  (2.1a), 
(2.4b),  (2.4c),  and  (2.6b),  and  combining  with  (23b),  establishes  Lemma 
2  The  corollary  follows  easily  from  Lemma  2  and  (22). 

Example — The  Optimal  Limited  Preview  Driver:  To  demonstrate  the 
power  of  Lemma  2  the  optimal  limited  preview  control  policy  with  no  a 
priori  knowledge  of  the  trajectory  characteristics  will  now  be  derived. 
Assuming  a  control  policy  of  the  form  (X( /),  ff(r,r)),  the  penalty  is, 
from  (28b)  and  (22), 

rr,(Ar, ff»- Jf''j  |[ *(/)-*"(/)]*(/; r0.x0, Yr.(K, h)) 

*  ,r)-/f«(f ,r)lMT)dT 

-  f'  H^(t,t)yT{r)dr\  dt.  (2.10) 


where 

S(0-e’8R-'B'(-yi.<2yT;  f(r,)-0.  (2.6b) 

Let 

r(r)-y(/,/*-'(/)«(r).rr,u0())-f(r)-<'(/)F-'(r)€(«).  (2.7) 

Notice  that  from  (2.6a) 

y(i)m  tninS(t,x,YT,u0(-)) 

M 

and  so  since  from  (23)  J(t(,x,  Yr,  u(-))-0  Vx  (assuming  Tr  is  well 
behaved  at  tf),  it  is  dear  that  y(r)-*0  as  r-M^even  though  P~'(tj)  is  not 
defined.  Therefore,  from  (26)  and  (2.7), 


With  no  e  priori  information  on  ( yT(r ):  rx+T)  at  time  instant  t,  a 
‘‘reasonable"  choice  of  the  control  policy  is 

K{t)~K°0) 

/f(r,r)-(  *<t+T  (2.11) 

lO  r>r+T 


In  statistical  terms,  assuming  that  the  trajectory  is  a  martingale  pro¬ 
cess  is  compatible  with  the  assumption  of  no  a  priori  information  oo  the 
trajectory  characteristics.  Namely,  we  assume 


*IM*)/«,+t]-0  Vt  >r+T  (2.12) 


where  o,+T  is  the  e-algebra  induced  by  the  measurements  {y(e)—r<t- f 
T).  Now  using  (210)  one  obtains 


*• 

Differentiating  (27)  and  substituting  the  appropriate  expressions  from 
(24b),  (24c),  and  (26b),  establish  the  lemma. 

The  above  ideas  are  extended  to  suboptimal  control  policies  in  the 
following  lemma  which  defines  the  performance  penalty  paid  by  a 
control  policy  and  states  the  remarkable  expression  for  its  evaluation 
which  was  first  presented  by  Weraersson  in  (10). 


/J,J|(A(r)-K*(r)]x(r) 

>-*•(, )],«) 


Lemma  2: 

Jpjtp, x„  Yr,  «(•))  -  /(»<,.  *o.  Yr. «())-/(/„.  x„,  Yr,  «*(•)) 

(2.8a) 

■  /  ’'ll  u(r,  x(t ;  x0,  /„ ,  Yr,  u(  ■ )),  Yr  ) 

-e°(r, x(f,  x0,/0,  Kr, *(•)),  Yr)\\\dl.  (2.8b) 


+  'r)[/f(r,T)-/fO(r,  T))yr(T)  dr)'* 

<!'5> 
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where  tbs  identity  E[  ff(t)di]mE  fElf(t)/oi+T\dt  bee  been  used. 
Revoking  the  martingale  Assumption  in  (2.12)  the  second  term  in  (2.13) 
nullifies  to  yield 


min  H°(t,r)yT(r)dri  dt 

1  1  Ji0  U-'miaO+r./,)  T  II 


where  the  optima I  control  policy  is  (2.1 1).  Note  that  the  martin  gale- type 
assumption  amounts  to  little  statistical  information  of  the  stochastic 
process  (yr(-)}-  Further  justification  to  the  choice  of  the  policy  (210)  is 
given  in  the  remarks  following  Example  2  of  the  following  section. 

Hie  results  obtained  above  are  of  academic  interest  However,  their 
generality  does  not  allow  their  application  to  the  solution  of  practical 
problems  such  as  the  design  of  the  preview  length. 

To  obtain  further  insight  into  the  finite  preview  problem,  we  attach 
some  structural  properties  to  the  trajectory  to  be  followed.  Particularly, 
we  consider  the  infinite  duration  problem,  with  time-invariant  plant  and 
spectral  information  on  the  trajectory. 


III.  SracntAL  Analysis  or  the  Performance  Index 

Since  the  actual  trajectory  is  rarely  available  a  priori ,  it  is  important  to 
be  able  to  evaluate  a  driver’s  expected  performance  over  an  ensemble  of 
possible  trajectories.  This  section  considers  a  simple  but  useful  class  of 
ensembles,  namely  stationary  stochastic  processes  characterized  by  spec¬ 
tral  density.  The  problem  is  restricted  to  the  infinite  duration,  time 
invariant  case  and  is  analyzed  in  the  frequency  domain.  The  main  result 
is  that  the  performance  depends  upon  the  spectral  density  in  the  same 
way  for  stochastic  as  for  deterministic  processes.  This  result  is  used  to 
calculate  the  expected  performance  of  the  optimal  prescient  driver,  of  a 
driver  which  ignores  the  trajectory’s  spectral  density  (Miller’s  driver), 
and  of  a  driver  which  uses  knowledge  of  the  trajectory's  spectral  density 
optimally  (Tomizuka’s  driver). 

The  following  assumptions  are  made  in  this  section. 

Assumption  (Al)  A,  B,  C,  Q,  and  R  are  constant  matrices. 

Assumption  (A2)  The  plant  is  completely  controllable  and  observable. 

Assumption  (A3)  r,-  —  oo  and  tjm  +  oo. 

Assumption  (A4)  xo-0. 

Assumption  (AS)  The  control  policy  is  of  the  form  «(•)— ( K,  H(l-r)), 
where  K is  constant,  A-BK is  stable,  and  /“<J|/f(n)ll Jdr|<  oo. 

In  this  section  r0  and  x0  will  be  dropped  as  arguments  of  J  and  x. 

The  following  three  classes  of  trajectory  are  considered  in  this  section. 

I.  Finite  energy,  i.e.,  <*<  oo. 

II.  Finite  power,  i.e,  <||  Yr II 2>  <oo,  where 

<r>*  &»/->>*• 

III.  Zero-mean  finite-variance  wide-tense  stationary  stochastic  pro¬ 
cess,  lo^  £(yr(r)}-0;  E{yT(t+r)yrO))mBy(r)<<x,. 

The  performance  index  J(Yr,  u(  ))  must  be  defined  differently  de¬ 
pending  on  the  class  of  Yr  as  shown  in  Table  I.  Table  I  also  defines  the 
spectral  density  of  Yr  tor  each  class.  Using  the  appropriate  definitions  of 
performance  index  and  spectral  density,  the  following  theorem  applies  to 
all  three  classes  of  trajectory. 


Fit.  I.  Jpmm  as  Um  rangy  out  ot  a  lyttsa  dhvm  by  the  trajectory. 


Theorem  1:  Under  the  assumptions  (Al)  through  (AS),  the  perfor¬ 
mance  index  of  a  control  policy  u(-)  following  a  trajectory  YT  whose 
spectral  density  is  Sy(u)  is 

J(Yr,u())- ^tr/_^(i;/l(«))*f(«)4!/2(«)d«  (3.4) 

where  (Sy/2(a)ySy/1(u)  Sy(u),  the  gain  function  f(«)  — 
Gm(u)RGZ(u)+G,(u)QGf(u),  and  GJ.u)  and  Gj(u)  are  defined  in  the 
proof. 

Proof:  Under  the  assumptions  ot  time  invariance  and  stability  made 
in  this  section,  stable  transfer  functions  exist  between  yT  and  u  and 
between  yT  and  yr-.».  Calling  these  transfer  functions  <7„(u)  and 
G.(u)  respectively,  it  is  a  straightforward  exercise  in  harmonic  analysis 
to  establish  (3.4)  for  each  class  of  trajectory.  Notice  that  assumption 
(AS)  is  necessary  to  eliminate  any  c&otribution  to  J  from  the  initial 
condition  x0  tor  trajectories  of  class  I. 

Remarks:  This  theorem  is  important  because  it  allows  the  perfor¬ 
mance  of  drivers  to  be  examined  in  the  frequency  domain,  and  because 
it  allows  the  performance  index  of  a  driver  following  a  trajectory  of  class 
II  or  III  to  be  calculated  by  evaluating  the  performance  index  of  the 
driver  following  a  trajectory  of  class  I.  In  the  following  evmptra, 
trajectories  of  class  III  will  be  modeled,  as  is  common,  as  the  output  of 
the  following  stable  time-invariant  “trajectory  generator”  driven  by 
white  noise 


Xx(t)mATxT(t)  +  Btw(i);  xr(  —  oo)”  0  (3.5a) 

fr(0“CyJty(i)  (35b) 

where  $,(<■>)•/.  The  corresponding  class  I  trajectory  of  the  tame 
spectral  density  is  the  impulse  response  of  the  trajectory  generator, 
namely 


,,  fO  r<0 

') "  (  CTeAr‘BT  t  >  0  (3-5c) 

Example  I:  The  Optimal  Prescient  Driver:  It  is  known  from  [I]  that 
under  assumptions  (Al)  through  (A3),  the  optimal  prescient  control 
policy  is  u°(-)-(JC°,  H°(t-r)),  where 


TABLE  I 


Class 

I.  Finite  Energy 

II.  Finite  Power 

III.  Stochastic 

Performance 

Index 

r  «o 

/-  f  Ldt 

ae 

/-<L> 

S-£«L» 

(3.1) 

Spectral 

Density 

/  MO/H'+T)  ***■$(“) 

<YrO+r)Yr(r)>*^(*>) 

£(0'r(,+T).»'f(T)>  }**S(*>) 

(3.2) 

•'-00 

where  X(t)**X(m)  denotes  the  Fourier  transform  pair 

m  P  OO 

Ar°-R-,B'FI( 

(3.6a) 

*(«)-  f  XU)*"*' <* 

'-60 

(32a) 

*o-(,-T).(*-'*'«-A'<'-T,c'e  r>i 

(3  6b) 

X(t)m±  dm. 

(3.3b) 

1 0  f<r 

\-A-BR-'B’P„. 

(3.6c) 
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fit  is  the  steady-stale  solution  of  (2.4b)  sad  satisfies  the  Rices ti  equation 
given  in  (3.7).  Thus,  A  is  stable.  If  I'm  is  generated  by  (3.5)  the  optimal 
control  may  be  stated 

*<Ht.x(t).Yntt))--K0x(t)+R-'B'P l2XrO) 
where  P,  -  satisfies  the  Riccati  equation 

L  MJ  *22 

Q~-P'A'-A''P'+?'B0R-yB't?t-C0QC'  (3.7a) 

with 


'.-[2  Ar\'  *"[!]’  C.-lC:-CTl  0.7b) 

The  performance  index  of  the  optimal  prescient  driver  is  given  in  the 
following  lemma. 

Ltmma  3: 

•  | /- Cf.T '(->/- A')- 'C'QCgSy'M  <t*>-  (38a) 

Furthermore,  if  Yr  is  generated  by  (3.5),  then 

J{  Yr,  u#())-*i{  Pu-raf„Pu]  *r  (38b) 

where 

F„A'+A  Plt»-BR-'B'.  (3.8c) 

Proof:  Applying  Psrsevsl's  formula  to  the  integral  in  (25)  yields 
(3.8a).  Note  that  the  first  term  in  (X5)  is  zero  since  (3.6b)  guarantees 
f ( -  ao)-0  for  a  finite  energy  Yr.  (3.8b)  is  obtained  by  considering  the 
suboptimal  control  policy  «(•)— (JC°,  0)  foliowing  y,ift).  Clearly 
x(t; yw,(JCa.0))-0  and  «(i,x(»; Y,MHK\0».YIM)mO  Vr.  Therefore 
(2.8c)  and  (2.2)  together  with  (3.5c)  and  (3.6b)  yield 

V(  r«.(  K*,0))  -  /_°J/o"w,(,-r)y,*(r)  4r|^  dr 

A  r«.(  *•.<»)- 

The  optimal  performance  is  then  given  by  (2.8a) 

A  r,K.  -°)  ~A  Ytx.(x\0)) -/„(  y/je,(Ar°,o)). 

Substituting  expressions  (3.5c)  and  (3.6b)  into  the  penalty  and  subopti¬ 
mal  performance  yields  (3.8b).  Details  are  given  in  (5). 

Note  that  the  performance  calculations  are  made  over  the  enure 
interval  (-ao,  +«),  although  the  impulse  is  not  applied  to  generate  y,K 
until  r-0.  The  significance  of  this  result  is  that  it  gives  a  lower  bound  on 
the  performance  index  of  any  driver  for  the  infinite-duration  problem. 
Furthermore,  it  should  be  noted  that  for  dam  III  trajectories,  the  control 
(3.6)  may  not  yield  /( - )  as  in  (3.8)  for  a  particular  reehzatioo  of  the  input 
(w (r)).  However,  the  performance  of  the  lemma  will  be  observed  on  the 
average.  In  the  case  of  second-order  ergodicity,  the  actual  performance 
will  be  that  of  (3.8)  for  almost  all  realizations  of  the  white  input  to  the 
trajectory  generator. 

The  performance  penalty  of  suboptimal  pobdet  can  be  expremed  in 
the  frequency  domain  as  in  the  following  lemma. 

Ltmma  4:  Under  (Al)  through  (AS)  the  penalty  paid  by  the  control 
poHey  «(•)—(*•,  HO-r))  is 

±  (3.9) 


Proof:  This  result  follows  easily  by  applying  ParsevaTs  formula  to 
(2.8c).  This  lemma  will  be  used  in  the  following  examples  of  suboptimal 
drivers. 

Example  2 — Miller's  Driver  (4);  Miller  [4]  has  proposed  the  time- 
invariant  version  of  the  optimal  limited  preview  driver  with  no  a  priori 
knowledge  of  the  trajectory  characteristics,  namely  «*(-)— (AT0,  HM(t- 
r)),  where 

t<i+T  (3 .10) 

U  r  >/+T 

The  performance  penalty  of  Miller’s  controller  is  given  in  the  following 

lemma. 

Lemma  5: 

•u(jv.  «*(»- tr^  /_“  (i;/2(«>)* 

•  j|A -'B'eAT( -jut- A') ~ 1  C'Qd^Sy n(u) du.  (3.11a) 
Furthermore,  if  Yr  is  generated  by  (3.5),  then 

■U(  IV.  -*(•»-«*[  /VAr*.*AT',t+4]  Bt  (3.1  lb) 

where 

l»2JAr+/l'ri»a-  —  P'l2eAT8R~ ' B'eA  TPxl.  (3.11c) 

Proof:  (3.11a)  is  obtained  by  transforming  (3.6b)  and  (3.10)  and 
substituting  into  (3.9).  (3.11b)  is  obtained  by  using  (3.5c),  (3.6b),  and 
(3.10)  to  write  (2.8c)  as 

4-(  yr .“"())  -  /_~Jf  /0“"°( '  ~  T  W  T  >  ^  <* 

and  then  substituting  (3.5c)  and  (3.6b). 

Remarks:  In  the  frequency  domain,  lack  of  knowledge  about  the 
trajectory  characteristics  means  that  energy  must  be  considered  equally 
probable  at  all  frequencies,  i.e.,  that  5r(u)— /.  That  «*(•)  minimizes 
J^YtM  •))  for  S,(w)— /.  among  all  control  policies  of  preview  length 
T,  is  dear  if  Parscval’s  formula'  is  applied  to  (3.9)  to  give 

~trfJ^\H°U)fl,dt  +  ti  /_“B«(»)-W°(Olla<*. 

Example  3 — Tomiiuka’s  Driver  (7]-[9):  Tomizuka  and  Whitney  [7]- 
[9]  have  derived  the  optimal  driver  assuming  that  the  stochastic  trajec¬ 
tory  generator  as  given  in  (3.5)  is  known  to  the  controller.  Their  control 

law  is 

«r(t,)-u"(r  ,)  +  R-,B'eA7"F,2xr(«+T)  (3.12) 

where  xr(t*T)  is  the  best  estimate  of  the  state  of  the  trajectory 
generator  given  (yr(r):  rcr+T}.  «r(  )  is  the  best  estimate  of  u°(  ),  as 
seen  by  writing  the  last  term  of  (3.12)  as 

R-'»'eATPx2iTU+T)~  /"  H°{l-j)yT(r)d7 
Ji  +  T 

where 

y‘r(T)-Cre'<r»iI.(t+  T).  r>f+r. 

Lemma  6:  If  Cr  -/,  Tomizuka’s  driver  pays  the  penalty 

-W  Tr,  «r(  • ))  -  tr  ^  /_"  (  iy1(«))*)|R- 1 B  V A  r 

[(-7«/-A’)-,C(?-F„]||J(|iy,(w)dw.  (3.13a) 
Furthermore,  if  XT  is  generated  by  (3.5)  and  xr(t+  T)  is  measurable. 
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Fig.  2.  Tha  opUwal  prMriml  eoairolkr  fo Bowing  tka  impnb«  mpotm  of  tb«  trajectory 

fnaralor. 


ilm  regardless  of  CT 

W  ATr ,  «r())  -  B'T^TPllt^aBT.  (3.13b) 

Proof:  Since  xr(t+  T)~xr(i+  T)—yT(t+  T\  the  last  term  in  (3.12) 
may  be  included  as  a  delta  function  inside  the  convolution  integral  in 
(2J).  The  proof  is  then  similar  to  that  of  Lemma  3.  Details  are  given  in 

W 

Remark:  Applying  Parseval’s  formula  to  (3.9)  yields 

WrM)) 

mf-J  f~H°«-')xA*) 1‘* TH(t-')xT(T) rfr|V dr. 

dearly  in  this  case  the  optimal  choice  for  H(i-r)  is 

HT0-r)-HM(t-r)+S(t+T-r)  f"  M°{I-t)xt(.t)Rt. 

Jt+T 


IV.  Simulation  Results 

The  purpose  of  this  section  is  to  report  the  remits  of  computer 
simulation*  which  demonstrate  the  theory  developed  earlier,  and  to 
provide  some  insight  into  the  driving  problem  in  the  frequency  domain 
The  equations  used  here  are  die  discrete  time  versions  of  the  relations 
derived  above,  and  ate  given  in  the  Appendix.  The  plant  which  is  studied 
is  described  by  the  following  system  matrices: 


r  o.o 

1.0  1 

,4°*1 

l  -0.99 

1.99  J 

L  i.oj 

and  unless  otherwise  noted,  0-0.1  and  A-2. 

The  correspondence  between  the  stochastic  and  deterministic  prob¬ 
lems  is  illustrated  by  Figs.  2  and  3  for  the  following  trajectory  generator 

M-SJj  !&]  M?S]  <«» 

In  Fig.  2  the  trajectory  yT  is  the  impulse  response  of  the  trajectory 
generator,  in  Fig.  3  yT  is  the  response  of  the  trajectory  generator  to  a 
white  noise  sequence.  In  both  figures,  yf  is  the  output  of  the  plant  under 
optimal  prescient  control  Notice  the  similarities  between  the  frequency 
contents  of  the  two  trajectories  and  between  the  tracking  errors. 


Actual  driving  performances  were  computed  by  direct  simulation  for 
several  realizations  of  the  stochastic  trajectory  generated  via  (4.2).  Table 
I  compares  the  average  performances  observed  over  ten  runs  of  1000 
steps  with  the  expected  performances  calculated  from  (A3. lib)  and 
(A3. 13b)  for  Miller’s  driver  and  Tomizuka’s  driver,  respectively,  as  a 
function  of  the  preview  length.  The  averages  agree  quite  well  with  the 
expected  values. 

Several  general  trends  can  be  seen  in  Table  II.  Tomizuka’s  driver 
outperforms  Miller’s  driver  at  each  preview  length,  and  always  improves 
its  performance  as  the  preview  length  is  increased.  Miller’s  driver,  on  the 
Other  hand,  sometimes  performs  more  poorly  with  an  increased  preview 
length  (compare  JC—t  and  K— 9)  due  to  its  ignorance  of  the  trajectory 
spectrum. 

It  is  insightful  to  examine  the  gain  functions  T  of  the  various  driven. 
As  shown  by  (3.4),  T  indicates  how  heavily  the  trajectory  energy  at  each 
frequency  is  weighted  in  the  calculation  of  the  overall  performance. 
Thus,  designing  a  previewing  controller  may  be  thought  of  as  specifying 
a  T,  subject  to  the  preview  constraint,  which  is  does  to  f®  in  some  sense. 

r°  is  plotted  for  several  values  of  the  control  cost  R  in  Fig.  4.  As 
might  be  expected,  for  a  given  R  the  optimal  prescient  driver  is  better 
able  to  follow  the  low-frequency  components  of  the  trajectory.  Notice 
that  the  upper  limit  on  T*  is  Q,  which  corresponds  to  the  complete 
inability  of  the  driver  to  track  a  particular  frequency.  As  R  increases,  the 
range  of  untrackable  frequencies  grows. 

Tr  is  plotted,  for  several  values  of  preview  length,  along  with  the 
trajectory  spectral  density  for  two  different  trajectory  generators  in  Figs. 
3  and  6.  The  trajectory  generator  for  Fig.  3  is 

Ar-0.9  gr-l.  Cr-0.042  (4J) 

whose  spectrum  is  very  narrow.  As  a  consequence,  fr  is  very  dose  to  f* 
at  low  frequencies  where  the  trajectory's  energy  is  concentrated,  whereas 
it  is  far  from  T°  at  the  less  important  higher  frequencies.  On  the  other 
hand,  the  trajectory  generator  for  Fig.  6, 

Ar— 0.6  Jr-I.  Cr— 0.138  (4.4) 

has  a  fairly  wide  spectrum  which  forces  fr  to  fit  f°  over  a  wider 
frequency  range.  As  the  trajectory  spectrum  becomes  flat  (rfg-*0), 
Tomizuka’s  driver  becomes  identical  to  Miller’s  driver,  whose  Tm  is 
plotted  in  Fig.  7  for  several  previewing  lengths. 
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M  uter  i  Driver 


Tomizuka’s  Driver 


Note  that  if  the  trajectory  energy  is  concentrated  around  the  frequency 
*»— 12,  Miller's  driver  performs  better  with  preview  length  AT-  1  than 
AT -2,  a  phenomenon  observed  also  in  Table  II  at  AT- 8,  AT-9  for  the 
trajectory  4.2.  On  the  other  hand,  if  the  trajectory  spectrum  is  available, 
Tomizuka’s  driver  always  improves  its  performance  with  increase  in  the 
preview  length.  Finally,  it  should  be  noted  that  Tomizuka’s  driver  may 
perform  more  poorly  than  Miller's  if  the  actual  trajectory  spectrum 
differs  significantly  from  that  for  which  it  was  designed. 


V.  Concluding  Ruaus 

Expressions  for  driver  performance  have  been  presented  for  both  the 
time  varying  and  the  time-invariant  preview  control  problems.  Of  partic¬ 
ular  importance  is  the  result  that  a  driver's  expected  performance  over 
an  ensemble  of  trajectories  generated  by  a  linear  system  driven  by  white 
notes  is  just  the  driver's  actual  performance  following  the  impulse 
rasp  nan  of  the  trajectory  generator.  This  result  comes  about  because  the 
quadratic  performance  index  is  expressible  in  the  frequency  domain  and 
became  the  impulse  response  has  the  same  spectral  composition  as  the 


approach  of  this  paper  has  significance  beyond  the  perfor¬ 
ation  of  control  policies.  The  duality  of  the  linear  quadratic 
item  and  least  squares  estimation  indicates  that  the  tame 
s  be  etifacd  to  evaluate  the  design  of  suboptimal  estimators, 
i,  the  evaluation  of  the  "goodness”  of  a  finite  All  or  MA 
eating  an  ARMA  model  can  be  based  on  theee  ideas. 


The  problem  investigated  here  entails  closed  form  solutions  since  the 
trajectory  was  parameterized  by  time.  An  open  issue  which  deserves 
attention  is  the  study  of  self-paced  controllers  [61  which  are  more 
realistic  models  of  the  human  driver. 

Ammu 

Ths  Dbcxztz  Tna  Casi 

The  basic  relations  discussed  in  the  paper  are  listed  here  for  the 
discrete  time  driving  problem,  keyed  for  ease  of  reference  to  their 
continuous  counterparts  in  Sections  II  and  III. 

The  system  to  be  controlled  and  the  performance  index  are 

x(k+l)-A(k)x(k)+S(k)u(k);  x  (*«)-*,  (A2.la) 


r/4k)-C(k)x(k) 


(A2.lb) 


/(*s.*o.IW-))-  2  {!»(*) ~.>’r(*)l<K*)+l,,(*)lju»>} 

m 

(A2J) 

and  the  notation  (AT(*X  H(k,i))  rep  me  cuts  the  linear  control  policy 


«(*.jr(*),yr)--Ar(*)jr(*)+  2  H(k,i)yr(i).  (A2.2) 

/-*• 
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Fl«.  A  TIm  *aia  faactioa  ot  tfc*  npnitnl  pwaol  drivw. 


Ik  optimal  pmcMi  driver  and  it*  performance  are  given  by 

M\k,x{k),rT)m.K-'\»'Hk-¥\)Ax{k)-»'ak*\)) 

(A2.4a) 

*/-'  , 

/(*,. A,. rT, **<))-  2  |M*)-C/ -'(*)«*)!**,  (A2J) 

tan 


*(*)-*+«'/»(*+!)* 

The  penalty  paid  by  a  subop timal  driver  is 

*/->  , 

“(  ))“  2  |n(*.  )-««(*.  •)!**,.  (A2.8) 


Hk)-AlHk+ 1  )-F(*+ 1)  a* -lB'W*  l)l^+C'fiC; 

P{kf)-C'QC  (A2.4b) 

•«  (k,)-CQ/r  (A2.4C) 


All  the  above  relation*  are  valid  for  time-varying  system  parameter*, 
even  though  for  brevity  the  time  argument  ha*  sometimes  been  omitted. 

The  spectral  analysis  of  Section  III  may  be  carried  out  using  the 
discrete  Fourier  transform 

I  f  40 

X(k)mJ-(  X{t+‘)**id»~X(**‘)-  2  X(k)t -**. 

(A3 .3) 


ms  TRANSACTIONS  ON  AUTOMATIC  CONTROL,  VOL.  AC-25.  NO.  S,  OCTORBR  1980 


rig.  T.  Tm  gMa  hMin  of  MUNr'i  <trtv«r. 


With  tbs  obvious  modification*.  Theorem  1  is  valid  for  discrete  tm:. 

Tbs  time-invariant  optimal  prescient  driver  «®(-)“(*#.  H(k-i)'  its 
lain,  and  its  performance  following  the  trajectory  generator  are 


>„-A 


(A3 .6c) 
(A3 .8c) 


(A3  M) 


H*{k-i)-l*-'*W(k-‘}-'C-Q  k-i<  0  (A3fb) 

lo  *-t>0 

r*(^-)-|7-  C/,7  •(.  ->•/-  A']-  'C'fie  (A3  Ja) 

A  rr>  "•<■)) -*t[  fn  -raAxAi] *r  (A3.Sb) 


aad  A„  B,,  and  C,  are  as  defined  in  (3.7b). 

MiUer*s  driver  for  preview  length  JC,  u*(-)«(/t°,  HM{k-i)),  its  gem, 
aad  its  penalty  following  the  trajectory  generator  are 


H°(k-i)  i*k  +  K 
0  i>k+K 


(A3. 10) 


-At  "£  ]  *  '.-A*.*'' C’QC. 


(A3 ,7a) 


r*(  »>*)  -  f  °( *'" )  +  j|*  ' 1  * '( A') *(  r  *"*/ -  A)  “  'CQ\  * 

(A3.Ua) 

<U(>V.  «"<»-*r[  ^nA*A,(A  ')*>,,  +  4]^  (A3Ub) 

Ml 

A*)  ,  (A3. lie) 
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Tomizuks's  driver,  its  gsin  when  Cr-/,  end  its  penslty  when  xr(t+T ) 

is  measurable,  are 

uT(-)~(K°,  H*(k-i))  +  R-xB'(\‘)KPxlATxT(k  +  K)  (A3.12) 

fr(e>“)- !••(*■«)  + J* A')*[  (e ->*/- A)" 'C(?- /*„^r]|4 

(A3. 13a) 

W  xr,  uT(  • ))  -  B'Trn  \KP\ ,(  A)*?,,  Bt.  (A3. 13b) 
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ASSTIACT 

The  perception  of  the  fault  directory 
approach  to  analog  circuit  dlagttoela  le  introduced 
by  a  case  study t  the  diagnosis  of  s  switching 
voltage  regulator* 

INTRODUCTION 

Fault  Diagnosis  of  analog  circuits  has  bacons 
an  active  research  ares  recently  (1-9).  A  good 
survey  of  this  subject  can  be  found  In  (10,11). 

Moet  of  the  nathods  (1-9)  considered  the  circuits 
as  Interconnections  of  discrete  coaponents  even 
though  there  are  clearly-defined  functional  blocks 
In  the  ~lrcult.  This  Is  because  these  aathoda  de¬ 
pend  on  the  aseuaptlon  that  each  coaponent  can  be 
characterised  by  soaa  paranetera.  This  aaauaptlon 
asy  not  be  valid  for  functional  blocks.  Also 
these  aethoda  require  neaaureaenta  be  taken  In  a 
purely  DC  or  purely  AC  teat.  This  restriction  aay 
haaper  th~  diagnosis  capability  to  certain  cir¬ 
cuits,  o.g.,  a  switching  circuit. 

Taking  the  above  probleaa  Into  account,  a  new 
approech,  the  fault  directory  approach,  which  Is 
conceptually  different  froa  the  previous  aathode, 
will  be  presented.  The  first  step  of  this  aathod 
la  to  divide  a  circuit  into  functional  blocks. 

Then  a  set  of  fault  logic-equations  is  set  up 
baaed  on  the  descriptions  and  the  operating  condi¬ 
tions  of  these  blocks.  Froa  the  above  equations, 
the  fault  directory  la  established.  Note  that  the 
operating  conditions  of  each  functional  block  are 
taken  Into  consideration  In  the  fault  directory 
net hod.  A  functional  block  can  work  properly  only 
when  Its  operating  conditions  are  not.  Therefore, 
any  practical  fault  diagnosis  aathod  should  take 
operating  conditions  Into  consideration. 

To  show  the  versatility  of  this  aathod,  we 
will  present  a  case  study  on  a  practical  switching 
voltage  regulator.  Thla  particular  circuit  la 
chosen  because  It  la  a  realistic  circuit  and  It 
has  features  which  are  not  considered  by  previous¬ 
ly  aeatloned  nathods i 

(1)  It  Is  a  hybrid  circuit,  consisting  of 
both  analog  and  digital  subelrcults. 

(2)  It  contains  nonlinear  elenents  whose 
nominal  characteristics  aay  not  be  pre¬ 
cisely  described. 

(3)  It  has  switching  subcircuits* 

Mote  that  this  circuit  cannot  be  fault  diagnosed  by 

•This  paper  was  supported  in  part  by  the  Office  of 
Naval  Research  under  Grant  N001A-78-C-04AA. 
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the  nathods  based  on  a  purely  DC  or  purely  AC  test. 

In  the  following  section,  each  functional 
block  of  the  circuit  will  be  described  In  detail. 

The  perception  of  the  fault  directory  approach  will 
be  Introduced  by  the  diagnosis  of  this  circuit. 

A  CASE  STUDY 

In  this  section  we  want  to  show,  step  by  step, 
how  a  fault  directory  can  be  established  to  Isolate 
faulty  functional  blocks. 

1.  The  Switching  Voltage  Regulator  Circuit 

The  circuit  under  study  Is  a  switching  voltage 
regulator  as  shown  In  Fig.  1.  In  the  circuit,  the 
node-voltages  1-6  are  neasureable.  Vj  Is  the  Input 
voltage,  V0  the  output  voltage,  Vg  the  reference 
voltage  and  Vq  the  control  voltage.  The  purpose  of 
this  circuit  Is  to  regulate  the  output  voltage  VQ  so 
that  It  Is  close  to  the  reference  voltage  Vg. 

This  Is  a  slightly  aodlfled  circuit  froa  an 
actual  circuit  design.  Note  that  It  contains  feed¬ 
back  path,  logic  circuits,  switching  circuits,  and  a 
transformer.  The  purpose  of  this  section  Is  not  to 
develop  a  diagnosis  algorithm  for  this  circuit,  but 
to  deaonstrate  the  applicability  of  our  approach. 

The  nethod  presented  here  can  be  applied  to  the 
original  circuit,  and  will  lead  to  the  sane  answer. 
But  sore  explanation  would  be  needed. 

2.  Functional  Hocks  and  Operating  Conditions 

In  the  first  step,  the  circuit  la  decomposed  In¬ 
to  four  functional  blocks,  as  shown  by  the  dashed 
11ns  In  Fig.  1.  The  result  la  a  functional  block 
circuit  shown  in  Fig.  2.  Those  functional  blocks 
are  chosen  because  of  their  specific  functions  In 
the  circuit  which  will  be  described  in  the  following 
paragraphs. 

Before  specifying  the  functions  of  the  func¬ 
tional  blocks,  soaa  notations  have  to  be  Introduced. 


Notations 

1.  V-  denotes  a  positive  voltage  significantly 
greater  than  sero. 

2.  V  denotes  a  negative  voltage  significantly 
aasller  than  sero. 

3.  V°  denotes  a  voltage  very  cloea  to  sero. 

4.  I+,  I**  and  1°  are  slallarly  specified. 

5.  t  denotes  an  increase  in  voltage  or  current. 

6.  4  de  otee  e  decrease  In  voltage  or  current. 
With  the  above  notations,  we  will  specify  the 
functions  of  the  functional  blocks  Bl,  B2,  B3,  B4 
as  well  as  L,  Rt,  and  R2. 
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1.  Operating  Condition* 


a)  0  <  V6  <  Vt  at  all  tinea. 

b)  V0  la  not  atuck  at  a  fixed  voltage  by 
the  load. 

2.  The  function  Table 


If  all  the  operating  condition*  are  aatlafled 
and  all  functional  block*  are  connected  aa  shown  In 
fig.  2.  the  sequence  of  event*  are  given  aa  follow* 

1>  VC  „  VC° 


Vo 

Vl 

12 

v3  i3 

u 

V5 

Vl 

h  v0 

<VR 

Vl+ 

h* 

v3°  I3* 
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♦  ♦ 

Concluding  fron  the  first  and  last  colunn,  If  V0  < 
Vg,  then  V0  lncreaaea  and  If  VQ  >  Vg,  then  ,0  de¬ 
creases.  Therefore,  VQ  will  be  regulated  at  Vg. 

2)  Vg  -  VC+ 

In  This  case.  It  la  easy  to  see  from  the  func¬ 
tions  of  B2  that  V0  will  decrease  to  sero  and  the 
circuit  no  longer  works  like  a  voltage  regulator. 

Note  that  for  each  functional  block  we  have 
specified  operating  conditions  as  well  as  a  func¬ 
tional  table.  This  steus  frou  the  need  of  specify¬ 
ing  operating  conditions  for  practlcle  functional 
blocks.  Consideration  of  the  operating  conditions 
Is  essential  In  the  fault  diagnosis,  since  a  devia¬ 
tion  frou  the  functional  table  nay  be  caused  by 
either  the  failure  of  the  functional  block  Itself 
or  by  a  change  of  the  operating  conditions. 

3.  fault  Logic-Equations 

the  fault  logic-equation  for  each  block  is 
baaed  on  the  aaaiaptlon  that  the  functional  block 
la  fault-free  If  and  only  If  Its  operation  condi¬ 
tions  are  aatlafled  and  It  functions  as  described 
by  Its  funclonal  tabla.  The  set-up  of  these  fault 
logic-equations  will  now  be  Illustrated. 

Note  first  that  the  failure  of  operating  con¬ 
dition*  nay  be  caused  by  a  fault  of  other  function¬ 
al  blocks.  Such  cases  are  listed  In  the  following 
observations  > 


Observation* 
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2. 


If  81  is  not  shorted  end  v2”v2° •  then  Vj  cannot 
be  stuck  by  load. 

If  R2  1*  neither  opened  nor  aborted  and  Vg-Vg* , 
then  V3  cannot  be  stuck  by  the  load. 

3.  If  0<Vg<Vg  then  L  is  neither  opened  or  shorted. 

4.  If  L  Is  not  shorted  then  Vj  cannot  be  stuck  by 
the  load. 

These  observations  will  be  used  to  set  up  fault 
loglc-equat ions . 

The  other  operating  conditions  will  not  be  put 
into  the  fault  loglc^quatlonss 
A)  V2  -  V2® 

»)  Vg  •  Vg+ 

C)  0  <  V6  <  Vt 


0)  V0  la  not  stuck  by  tha  load. 

The  first  step  of  fault  diagnosis  Is  to  verify  the 
above  conditions.  If  they  are  satisfied,  then  the 
fault  logic-mquatlona  can  be  set  up  as  follows: 

(El)  f(Bl)  -  f (Bl)  x  f(Rl  not  shorted) 

(E2)  f(B2)  -  f(B2)  x  f(Rl)  x  f(R2) 

(E3)  ?(B3)  -  f(B3)  x  f (R2) 

(E4)  f(B4)  -  f(B4)  x  f(L) 

where  x  Is  a  logical  AND  operator,  and  F  and  f  have 
the  fo' lowing  eeanlng : 

F(bl)  has  value  1  If  Bl  Is  estimated  to  be 
fault-free, 

0  If  Bl  la  estimated  to  be 
faulty, 

f (Bl)  haa  value  1  If  Bl  Is  actually  fault-free, 
0  If  Bl  Is  scutally  faulty. 

The  others  sre  similarly  defined.  These  equations 
described  the  relationship  from  the  actual  fault 
status  of  the  subcircuits  and  the  status  of  the 
subcircuits  concerning  with  the  operating  condition 
to  the  estimated  status  of  the  subclrculta.  Take 
(El)  for  Instance;  It  meane  that  If  Rl  la  not 
shorted  and  Bl  Is  fault-free,  then  the  test  of  Bl 
will  conclude  fault-free.  The  reason  la  as  fol¬ 
lows.  If  Rl  Is  not  shorted  and  by  previous 

test,  then  V;  Is  not  stuck  by  the  load,  l.e.,  the 
operating  condition  of  Bl  Is  satisfied.  The  test 
of  Bl  by  Its  function  table,  therefore,  will  yield 
the  true  state  of  fault  of  Bl.  Note  that  If  Rl  la 
shorted,  a  teat  of  Bl  will  yield  Bl  at  fault  even 
though  It  la  fault-free. 

A.  A  Fault  Directory 

Tram  these  fault  logic-equations,  tha  fault 
truth-value  table  (Table  1)  can  be  easily  obtained 
for  the  case  that  at  most  one  of  functional  block 
la  at  fault.  As  can  be  aeen  In  Table  1,  there  Is 
a  pattern  of  estimated  status  of  Bl,  B2,  B3,  and 
B4  correapondlng  to  each  fault  case.  By  reorgan¬ 
ising  Table  1  In  terma  of  theae  patterns,  we  have 
the  fault  directory  (Table  2)  which  Indicates  the 
poselble  fault  location  once  the  pattern  Is  ob¬ 
tained  from  measurements  and  simple  estimations. 

From  Table  2,  It  can  be  aeen  that  for  the 
atngle  fault  caae,  the  fault  can  be  Isolated  Into 
one  of  four  groups:  Bl,  (B2,R1),  B3,  snd  (B4,L). 
Once  s  group  Is  Isolated  to  be  at  fault,  tha  fault 
directory  approach  can  then  be  applied  to  this 
group.  The  process  can  be  repeated  so  that  s 
hierarchical  ayatem  can  be  set  up  for  diagnosis 
purposes.  This  concludes  the  demonstration  of  the 
fault  directory  approach. 


3.  Conclusion 

The  fault  directory  approach  for  analog  cir¬ 
cuits  Is  similar  to  the  fault  dictionary  approach 
for  digital  circuits.  It  requires  minimum  comput¬ 
ing  for  users.  This  Is  the  major  advantage  of  this 
method. 

It  has  demonstrated  that  It  has  the  capability 
to  diagnose  circuits  which  consist  of  nonlinear 
components,  switching  circuits  and  analog-digital 
hybrid  circuits. 

Functional  blocks,  rather  circuit  components 
or  elements,  are  used  as  the  baels  for  our  method. 
This  seems  to  be  a  natural  approach  for  fault 
dlagnoala. 
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Fault  Diagnosis  in  Electronic  Circuits 


During  the  past  quarter  century  the  engineering 
community  has  been  witness  to  tremendous  strides 
in  the  art  of  electronics  design.  The  graphical 
algorithms  of  the  previous  generation  have  given 
way  to  the  modem  CAD  package,  the  breadboard 
has  been  subsumed  by  the  simulator.  Indeed,  even 
the  universal  building  block  has  become  a  reality. 
To  the  contrary  electronics  maintenance  has  changed 
little  since  the  day  of  the  vacuum  tube,  remaining 
the  responsibility  of  the  experienced  technician 
with  scope  and  multimeter.  As  such,  our  ability 
to  design  a  complex  electronic  circuit  is  quickly 
out-distancing  our  ability  to  maintain  it.  In  turn, 
the  price  reductions  which  have  accompanied 
modem  electronics  technology  have  been  paralleled 
by  increasing  maintenance  and  operations  costs. 
Indeed,  many  industries  are  finding  that  the  life 
cycle  maintenance  costs  for  their  electronic  equip¬ 
ment  now  exceed  their  original  capitol  investment. 

Given  the  above,  it  is  quickly  becoming  apparent 
that  the  electronics  maintenance  process,  like  the 
design  process,  must  be  automated.  Unfortunately, 
the  50  years  of  progress  in  circuit  theory,  on 
which  our  electronics  design  automation  has  been 
predicated,  does  not  exist  in  the  maintenance 
area.  As  such,  the  past  decade  has  witnessed  the 
inauguration  of  a  basic  research  program  to  lay  the 
foundations  for  a  theory  of  electronics  maintenance 
and  a  parallel  effort  to  develop  operational  elec¬ 
tronic  maintenance  codes. 

Thus  far  the  greatest  success  has  been  achieved 
in  the  digital  electronics  area,  wherein  the  finite 
state  nature  of  the  UUT  (unit  under  test)  may  be 
exploited*’.  Typically,  one  assumes  that  all  fail- 

*  Texas  Tech  University 

•*  University  of  Notre  Dame 


R.  Saeks*  and  R.  Liu** 

ures  manifest  themselves  in  the  form  of  com¬ 
ponent  outputs  which  are  either  “stuck-at-one”  or 
“stuck-at-zero”  and/or  shorts  and  opens51.  Under 
such  an  assumption  a  theory  for  digital  system 
maintenance  has  been  developed  and  practical 
fault  diagnosis  algorithms  are  in  the  formative 
stages  of  development.  Typically,  one  hypothesizes 
some  limit  on  the  number  of  simultaneous  faults 
and  then  simulates  the  responses  of  the  UUT  to  a 
family  of  test  vectors  for  each  allowed  combination 
of  faults.  The  actual  responses  of  the  UUT  are  then 
compared  with  the  simulated  responses  to  locate 
the  failure.  Although  lacking  in  asthetic  appeal 
the  above  approach,  termed  fault  simulation,  is 
ideally  suited  for  the  maintenance  environment, 
wherein,  the  actual  simulation  process  reed  only  be 
done  once  at  the  factory  or  a  maintenance  depot 
with  the  simulated  response  data  being  distributed 
via  magnetic  tape  to  the  various  field  locations 
where  the  actual  test  actual  test  is  conducted.  As 
such,  with  the  aid  of  some  sophisticated  software 
engineering,  this  apparently  “brute  force”  approach 
to  the  fault  diagnosis  problem  has  slowly  evolved 
into  a  workable  concept41.  Indeed,  at  the  present 
time  a  number  of  automatic  test  program  gener¬ 
ators  which  classify  faults,  choose  test  vectors,  and 
carry  out  the  appropriate  simulation  (often  in  a 
parallel  processing  mode),  are  commercially  availa¬ 
ble  and,  as  such,  the  automated  maintenance  of 
digital  electronic  circuits  is  becoming  a  reality41. 

Unfortunately,  the  above  described  success  in  the 
digital  world  has  not  been  paralleled  by  progress 
in  the  analog  world.  Indeed,  test  engineers  com¬ 
plain  that  while  80%  of  the  boa-ds  are  digital,  80% 
of  their  ;headaches  are  analog  and  hybrid.  This 
difficulty  arises  from  a  number  of  characteristics 
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of  the  analog  problem  which  are  not  encountered 
in  digital  circuits.  Indeed,  in  an  analog  circuit : 

(  i )  there  is  a  continuum  of  possible  failures, 

( ji )  a  component  may  be  “in  tolerance”  but  not 
nominal, 

(|)  complex  feedback  structures  are  encoun¬ 
tered, 

(jv)  simulation  is  slow  and  costly. 

( v )  poet-fault  component  characteristics  may 
not  be  known, 

(vi)  and  a  fault  in  one  component  may  induce 
an  apparent  fault  in  another 

Items  (  i )  and  (  a )  imply  that  an  extremely  large 
number  of  simulations  will  be  required  for  analog 
testing.  Items  (i)  and  (hr)  suggest  that  these 
simulations  will  be  far  more  expensive  than  similar 
digital  simulations.  Finally,  items  (v)  and  (vi) 
indicate  that  the  simulation  of  a  post-fault  circuit 
by  itself  may  not  be  a  tractable  problem.  As 
such,  it  is  by  no  means  clear  that  the  kind  of 
“brute  force"  fault  simulation  algorithm  associated 
with  the  digital  problem  will  be  applicable  to  the 
analog  or  hybrid  case. 

As  an  alternative  to  fault  simulation,  a  number 
of  academic  researchers  have  proposed  a  variety  of 
“post  test"  fault  diagnosis  algorithms,  wherein,  an 
“equation  solving  like”  algorithm  is  used  to  locate 
the  faulty  component  given  the  test  data  from  U 
UT11-”.  Although  these  algorithms  are,  in  some 
sense,  “smarter”  than  the  simulation  algorithms, 
most  of  the  required  computing  must  be  done  in 
the  field  after  the  UUT  has  been  tested.  More¬ 
over,  these  computational  requirements  must  be 
replicated  each  time  a  unit  fails.  As  such,  the 
success  of  such  “post  teat”  algorithms  is  contingent 
on  reducing  their  computational  requirements  to 
a  bare  minimum.  Although  no  system  is  yet 
operational,  with  the  aid  of  the  powerful  linear 
circuit  theory  developed  over  the  past  half  century, 
a  computationally  efficient  solution  to  the  fault 
diagnosis  problem  for  linear  analog  circuits  appears 
to  be  within  reach11-11.  Unfortunately,  no  such 
light  exists  at  the  end  of  the  nonlinear  tunnel, 
wherein  progress  appears  to  be  limited  by  a 
"computational  complexity/test  point”  bound. 

Not  suprisingly,  the  computational  cost  of  an 


analog  fault  diagnosis  algorithm  is  an  inverse  func¬ 
tion  of  the  number  of  test  points  at  which  mea-  ] 

surements  of  the  UUT  may  be  made.  Indeed,  if 
one  lets  n  be  a  measure  of  UUT  complexity  (which 
may  loosely  be  taken  to  be  the  total  number  of 
terminals  for  all  of  the  circuit  components),  then 
if  one  has  access  to  0(n)(i£1)  test  points  the  fault  3 

diagnosis  problem  can  be  resolved  using  linear 
algorithms71-101.  Moreover,  by  combining  such  algo¬ 
rithms  with  the  above  mentioned  linear  algorithms, 
acceptable  computational  efficiency  can  be  obtained 
with  0(m)  test  points  where  m  is  a  measure  of  the 
complexity  of  the  “nonlinear  subsystem”  of  the 
UUT*1-71.  Although  such  algorithms  can  be  effec¬ 
tive  on  the  typical  academic  example  a  “real  world” 

PC  (printed  circuit)  board  does  not  have  terminal 
space  for  the  20  or  30  test  points  which  are  i 

required  even  for  a  routine  board  made  up  of 
discrete  components  and/or  SSI  (Small  Scale  Inte¬ 
gration).  Although  the  problem  can  be  partially 
alleviated  by  making  internal  measurements  with 
the  aid  of  a  “bed-of-nails”  tester  it  has  been  our 
experience  that  such  testers  cause  as  many  failures  j 

as  they  locate  while  their  applicability  to  two- 
sided,  multilayer,  and  coated  boards  is  severely 
limited.  As  such,  we  would  like  to  limit  the 
number  of  test  points  to  the  terminal  space 
available  at  the  edge  of  a  PC  board.  On  the  other 
hand,  the  UUT  complexity,  n,  increases  with  the 
area  of  the  board.  As  such,  the  number  of  test 
points  required  by  an  analog  fault  diagnosis  algo¬ 
rithm  should  increase  at  a  rate  of  no  greater  than 
0(n*(J).  A  further  study  of  the  possible  tradeoff 
between  test  points  and  computational  cost  appears 
in  references  11)  and  12). 

Unfortunately,  all  computationally  acceptable 
“post  test”  algorithms  which  have  thus  far  been 
proposed  have  test  point  requirements  which  grow 
linearly  with  UUT  complexity  (assuming  that  m 
grows  linearly  with  n).  As  such,  many  researchers 
are  looking  at  the  classical  fault  simulation  algori¬ 
thms  with  renewed  vigor.  Indeed,  these  algorithms 
have  minimal  on-line  computational  costs,  while 
the  number  of  test  points  employed,  can  easily  be 

(it  1)  f(n)— 0(n)  means  f  increases  in  the  order  of 
n;  more  precisely,  |f(n)|£c|n|  for  some  c>0. 
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kept  below  0(n2h).  The  difficulty  lies  with  the 
required  number  of  simulations  and  the  develop¬ 
ment  of  decision  algorithms  which  will  allow  us  to 
“interpolate”  between  simulated  data  points. 

Thus,  while  the  state-of-the-art  in  digital  diagn. 
nosis  is  fast  maturing,  a  serious  investigation  of 
analog  fault  diagnosis  problems  is  only  just  begin¬ 
ning.  Indeed,  a  satisfactory  fault  diagnosis  code  for 
linear  analog  circuits  has  yet  to  be  demonstrated 
while  the  nonlinear  problem  has  yet  to  progress 
beyond  the  basic  research  stage. 
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Topological  Conditions  for 
Single-Branch-F  ault 

C.  S.  LIN.  Z.  F.  HUANG,  and  R.  LIU 


Abstract  —The  testability  condition  for  single  branch-fault  of  an  analog 
network  is  given.  Then  several  necessary  and  sufficient  conditions  for  this 
testability  condition  are  provided  and  their  proofs  are  shown  in  detail. 
These  conditions  have  great  applications  in  the  testability  design  as  demon¬ 
strated  in  the  examples. 

I.  Introduction 

LET  N  (Fig.  1)  be  a  connected,  b-branch.  (n  +  l)-node, 
linear  time-invariant  network.  Out  of  the  (n  +  1)  nodes, 
(m  +  1)  are  accessible  terminals  for  both  excitation  and 
measurement.  Let  one  of  the  accessible  nodes  be  the  refer¬ 
ence  node  of  the  measurements  denoted  by  rt0.  The  follow¬ 
ing  notations  are  used  for  N. 

vm  terminal  (accessible  node)  voltage  vector  with  re¬ 
spect  to  n0, 

im  terminal  current  vector, 
vb  branch  voltage  vector, 
ib  branch  current  vector, 
y  branch-admittance  matrix, 

M  set  of  all  accessible  nodes  including  nQ. 

Note  that  vm  and  im  do  not  contain  the  entries  of  n0  and 

each  contains  m  entries.  Also  note  that  ih  =  Tufc  and  Y 

needs  not  to  be  symmetrical.  Then  in  N  together  with  the 
m  measurement  branches  from  (M  -{«0»  to  n0,  we  have 
the  following  KVL  and  KCL  equations: 

“  Bmvm  (la) 

Qh'b  ”  ~  Qjm  (lb) 

where  B' s  and  Q's  are  the  submatrices  of  the  loop  and 
cut-set  matrices,  respectively. 

Since  N  is  connected,  we  can  choose  a  tree  in  N.  For  a 
given  tree  T,  denote  the  branch  voltage  vector  and  branch 
current  vector  of  those  in  T (its  cotree  in  N)  by  u,(uj  and 
i,(if),  respectively.  Label  the  branches  in  T  ahead  of  those 
in  the  cotree  of  T.  Then  vh  *  [vj"  :  vf]T  and  (1)  can  be 
written  as  follows: 


+  ;  u, 
0.  u. 
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Fig.  1 .  Network  N. 


[/:  +f]  !  -[-£]/„ 
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where  /  is  an  identity  matrix  and  E  and  F  are  submatrices 
of  Qb  and  Qm,  respectively,  w.r.t.  the  tree  T. 

Now  consider  that  the  network  N  is  perturbed  to  (N  + 
AN)  in  the  way  that  Y  is  perturbed  to  Y  +  AY,  and  the 
graph  remains  the  same.  In  the  perturbed  network,  we 
denote  the  corresponding  voltages,  currents,  and  admit¬ 
tance  by  vm  +  Avm,  im  +  Aim,  v„  +  Av„,  and  ib  +  A i„,  re¬ 
spectively. 

Given  Y  and  the  graph,  the  purpose  of  the  fault  diagno¬ 
sis  is  to  estimate  AY  from  the  information  im,  Avm,  and 
Aim.  Note  that  vm  can  be  calculated  from  im  and  N. 
Without  loss  of  generality,  we  may  assume  that 


i.e.,  apply  the  same  im  to  the  original  network  N  and  the 
perturbed  network  N  +  AN. 

Under  this  condition,  it  can  be  shown  from  (2)  that  in  N 


+  ET(HYHTy'Ein 


and  in  N+  AN 


vm  +  Aum  -  +  Et(  HYHt)  ~ 1  [ Eim  -  HAY{ vb  +  Ac,)] 


where 


//*[/  :  F], 

Subtracting  (3)  from  (4) 

Aum  -  -  Et( HYHt)~'HAY{ vb  +  Auft) 


C.  S.  Lin  was  with  the  University  of  Notre  Dame.  Notre  Dame,  IN.  He 
is  now  with  the  Department  of  Electrical  Engineering,  National  Tsing 
Hua  University.  Hsinchu.  Taiwan.  Republic  of  China. 

Z.  F.  Huang  and  R.-W.  Liu  are  with  the  Department  of  Electrical 
Engineering,  university  of  Notre  Dame,  Notre  Dame,  IN  46336. 


where 


b  Jb 


zmb*  +  et(hyht)-'h 
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and 

jb±-lY(vb  +  ±vh).  (6) 

Define  that  a  branch  is  at  fault  if  and  only  if  the 
corresponding  row  of  AX  is  nonzero.  Then,  in  (6),  the  k  th 
entry  of  jb  is  nonzero  only  if  the  /c  th  row  of  Ay  is  nonzero 
or,  equivalently,  the  A:  th  branch  is  at  fault.  And  the  con¬ 
verse  is  true  for  almost  all  practical  cases.  Taking  ad¬ 
vantage  of  this  property  of  jb,  (5)  suggests  that  one  can 
determine  the  location  of  nonzero  elements  of  Ay  and 
therefore  the  faulty  branches  provided  that  the  number  of 
accessible  nodes  is  large  enough.  This  discovery  was  made 
by  Biemacki  and  Bandler  [1],  [2]  and  Sakla  et  al.  [3].  The 
advantages  of  this  approach  are  that:  1)  one  can  rely  only 
on  the  parameters  of  N ,  i.e.,  the  nominal  network,  and  the 
measurements  to  diagnose  the  faulty  network  and  2)  the 
computation  involved  are  all  linear. 

However,  despite  all  these  advantages,  systematical  stud¬ 
ies  on  materializing  this  approach  to  locate  the  faulty 
branches  have  been  lacking.  In  the  next  section,  as  a 
demonstration,  the  diagnosis  of  the  single  branch-fault  will 
be  discussed  in  detail  and,  more  importantly,  the  graphical 
conditions  for  placing  the  test  points  will  also  be  stated 
and  proved. 

II.  Single-Fault  Testability 

In  this  section,  we  will  study  the  diagnosis  of  single 
branch-fault,  or  abbreviate^  single  fault,  by  using  (5).  For 
the  simplicity  of  demonstration,  only  the  networks  without 
dependent  sources  are  considered  in  this  paper.  Simple 
conditions  on  testability  will  be  obtained  by  taking  ad¬ 
vantage  of  a  property  that  network  without  dependent 
sources  has  diagonal  branch-admittance  matrix.  Similar 
results  can  also  be  obtained  for  the  general  cases. 

To  study  the  diagnosis  of  networks,  we  will  characterize 
a  network  by  its  capability  to  be  diagnosed  by  a  single  test 
vector  im.  This  is  given  in  the  following  definition. 

Definition  1:  A  network  N  is  said  to  be  single-fault 
testable  if  when  N  is  perturbed  to  N  +  A N,  by  choosing 
one  appropriate  test  vector  im,  one  will  be  able  to  de¬ 
termine  from  the  measurements  on  accessible  nodes  M: 

(a)  whether  or  not  N  +  A  N  has  more  than  one  branch 
fault, 

(b)  if  negative,  the  faulty  branch  can  be  uniquely  located. 

To  locate  single  fault  by  using  (5),  each  column-vector  of 

Zmb  is  compared  to  Aom  for  consistency  and  the  branches 
whose  corresponding  column- vectors  are  in  parallel  with 
Acm  are  the  potential  candidates  of  the  fault.  In  order  to 
achieve  single-fault  testability,  it  is  essential  to  have  any 
two  column- vectors  of  Zmb  linearly  independent.  Let  us 
define  this  condition  as  single-fault  testability  condition. 
This  is  a  necessary  condition  for  N  to  be  single-fault 
testable.  In  fact,  it  is  also  sufficient  as  shown  in  the 
following  theorem. 

Theorem  /.  N  is  single-fault  testable  if  the  single-fault 
testability  condition  is  satisfied.  Conversely,  if  a  single  test 
vector  is  used  then  it  is  also  necessary. 


The  proof  of  the  above  theorem  can  be  found  in  [4], 

Note  that  Theorem  1  translates  an  abstract  definition  of 
single-fault  testability  to  a  familiar  property  of  a  matrix 
Zmh  with  the  explicit  expression  which  can  be  further 
studied.  An  initial  step  is  to  investigate  the  implication  of 
two  column-vectors  of  Zmh  to  be  linearly  independent. 
However,  not  to  restrict  ourselves  to  only  two  column-vec¬ 
tors.  the  linear  independence  of  k  column-vectors  of  Z„b  is 
studied  and  the  result  is  shown  in  Theorem  2. 

Theorem  2:  Let  Q  be  a  set  of  k  branches  in  N  and  Zmq 
be  the  m  x  k  submatrix  of  Zmb  whose  columns  correspond 
to  branches  in  Q.  Then  Zmq  has  full  column  rank,  only  if 
there  exists  a  tree  T  in  N  such  that 

( 1 )  T  contains  Q, 

(2)  (T -  Q)  is  connected  when  all  accessible  nodes  in  N 
are  shorted  together. 

The  converse  is  also  true  for  almost  all  Y. 

Proof:  (The  necessary  part):  As  a  preliminary  step,  we 
want  to  show  that  Q  cannot  contain  loops.  Let  H  be  the 
nxk  submatrix  of  H  whose  columns  correspond  to  the 
branches  of  Q.  Then  Zmq  =  ET( HYHT)~  'H.  Evidently. 
Zmq  has  full  column-rank  only  if  H  has  full  column-rank. 
Since  H  is  a  submatrix  of  a  cut-set  matrix  with  the  same 
number  of  rows,  H  has  full  column-rank  if  and  only  if  Q 
contains  no  loop  [10].  Consequently,  the  branches  of  Q 
with  Zmq  full  column-rank  cannot  contain  any  loop. 

Since  Q  contains  no  loop,  we  can  choose  a  tree  T  in  N 
such  that  T  contains  Q.  Then  label  the  branches  in  the 
following  order:  branches  in  Q,  branches  in  (T -Q),  and 
the  rest.  Corresponding  to  this  labeling,  the  matrix  H  has 
the  following  from: 

’/*  1  0  1  £,]  A  [//,' 

-j - i  -  -  -  -1  (7) 

0  I  hn-k)  I  ^1  ,  . 

where  /*  is  an  k  x  k  identity  matrix.  £,  and  F2  are  matrices 
of  0,  1,  and  -  1,  with  dimensions  kx(b-  n)  and  (n  -  k) 
x(b  -  n),  respectively,  and 

4  [h  :  0  :  £,]  H2  *  [o  F2] 

and  Zmq  can  be  written  as 


ET(HYHT)' 


From  a  well-known  formula  in  [7],  it  can  be  shown  that 
Zmg  has  full_column-rank  if  and  only  if  there  exists  a  kxn 
submatrix  £  of  £  such  that  the  matrix 


UVljT  I 

W  A  HYH  ,  ---  (9) 

*  0 

is  nonsingular.  Expanding  the  determinant  of  W  by  k  x  k 
blocks  of  £ r,  we  have 


det  IF  =  £  det (E7),- det (  H2 YH ) , 
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where  ( £ T),  is  a  mxm  submatrix  of  £r,  ( H2YHT ),  is  a 
(n  -  k)X(n  -  k)  submatrix  of  ( H2YHT )  associated  with 
( £ T),  in  expansion,  and  the  summation  is  over  all  ( m  x  m ) 
submatrices  of  £ T.  Moreover,  any  term  det( HzYHr)l  in 
(10)  can  be  further  expanded  by  Binet-Cauchy  Formula 
[6]  into 

del( H2YHT ) '  =  £  det(f/;)y-det(y);-det((//r),)y 

J 

(11) 

where 

( Y)j  is  a  (n  -  k)X(n  -  k)  principal  submatrix  of  Y 
and  is  diagonal, 

(HT),  is  the  b  x(n  -  k)  submatrix  of  Hr  whose  col¬ 
umns  correspond  to  those  of  ( H->  YH T ),  in 

m 

((HT)i)]  is  the  (n-k)X(n-k)  submatrix  of  (HT), 
whose  rows  correspond  to  the  branches  in 
< Y), , 

( H2)j  is  the  (n  -  k)x(n  -  k )  submatrix  of  H2  whose 
columns  correspond  to  the  branches  in  ( Y  )y , 
and  the  summation  is  over  all  the  (n  -  k ) x 
(n-  k)  principal  submatrices  of  Y. 

Note  that  in  the  above  expansion,  the  assumption  of  Y 
being  diagonal  has  been  used. 

From  (10)  and  (1 1),  it  can  be  seen  that  W  is  nonsingular 
only  if  there  exist  a  pair  of  (/.  j)  such  that  det(£r)(. 
det(T)y,  det(( HT),)r  and  det (HZ)J  are  all  nonzero.  Since 
in  this  theorem  only  the  connection  of  the  branches  is 
concerned,  we  will  examine  only  the  conditions  of 
detff7),  *  0,  dei((Hr),)j  =*  0,  and  det(  Hz)/  *  0. 

First,  consider  the  implication  of  det( Hz)t  *  0.  Let  B  be 
the  set  of  branches  corresponding  to  the  columns  of  (Hz)r 
Then  from  the  definition  of  H2  in  (7),  it  can  be  seen  that 
BnQ  =  0  and  det( H2 )y  *  0  if  and  only  if  the  column- vec¬ 
tors  of  H  corresponding  to  the  branches  of  B*JQ  are 
linearly  independent.  Since  H  is  a  cut-set  matrix,  the  latter 
condition  is  equivalent  to  that  B  U  Q  contains  no  loop  in  N 
[10],  From  this  condition  together  with  the  fact  that  there 
are  n  branches  in  B  u  Q,  we  can  conclude  that  B'JQ  must 
be  a  tree  T  in  /V. 

Next,  consider  the  implication  of  det( ET){ =*  0  and 
det((//r),)y  *0  for  some  i.  First,  note  that  the  branches 
corresponding  to  the  rows  of  ((HT),)J  are  exactly  those  in 
B  which  together  with  Q  constitute  a  tree  T  in  N.  Let  H} 
be  the  nx(n  -  k)  submatrix  of  H  whose  columns  corre¬ 
spond  to  those  branches  in  B.  Without  loss  of  generality, 
we  may  assume  that  T  =  T  and,  therefore.  =  [0  : 
/(„_*,].  Then  we  let  N'  be  a  graph  constructed  from  N  by 
adding  m  branches  from  the  reference  node  n0  to  all  the 
rest  of  accessible  nodes.  Denote  the  set  of  these  m  branches 
by  B' .  Then  £  in  (2)  is  the  submatrix  of  the  cut-set  matrix 
of  N'  whose  columns  correspond  to  the  m  branches  in  B'. 
Let  B"  be  the  subset  of  B'  whose  k  branches  correspond  to 
the  columns  of  £.  Now.  from  ( 10),  we  can  see  that  det(  £ ), 
*0  and  det((f/r),)y  »0  for  some  i  if  and  only  if  the 


matnx 

o; 

has  full  row-rank,  or,  equivalently,  the  matrix  ( W3  :  £)  has 
full  column-rank.  The  latter  implies  B  u  B"  contains  no 
loop.  Furthermore,  since  B  U  B"  contains  n  branches,  it 
can  be  concluded  that  B  u  B"  forms  a  tree  in  N' .  In  other 
words,  T'—Q  +  B"  is  a  connected  graph.  Since  B"  e  B'. 
T'-Q+  B'  is  also  connected.  Thus  T'-Q  is  connected 
when  branches  in  B'  are  shorted  or.  equivalently  all  the 
accessible  nodes  are  shorted  together. 

Therefore,  we  have  shown  that  Zmg  having  full  column- 
rank  implies  the  existence  of  a  tree  7"  which  satisfies  the 
conditions  in  Theorem  2.  This  proves  the  necessary  part. 

( The  sufficient  party.  Suppose  that  there  exists  a  tree  T 
in  N  satisfying  the  conditions  of  Theorem  2.  We  will  prove 
that  Zmq  has  full  column-rank  for  almost  all  Y.  In  the  First 
step,  we  want  to  show  that  this  is  true  for  a  particular  K. 
We  choose  a  specific  set  of  branch  admittances  in  the 
following  way:  first,  set  all  admittances  to  1  for  those 
branches  in  (T -  Q),  then  set  the  rest  to  zero.  We  further 
label  the  branches  in  the  same  way  as  in  the  necessary  part. 
Then  it  can  be  shown 

( H2YHr)  =  [0  : 

Note  that  [0  :  is  also  the  submatrix  of  HT  whose 

rows  correspond  to  the  branches  in  T-  Q.  From  Condition 
(2)  of  Theorem  2.  (T-Q)  is  connected  when  nodes  in  M 
are  shorted.  That  is  equivalent  to  say  that  (T -  Q  +  B')  is 
connected  and  contains  all  nodes  in  N'  from  the  discussion 
in  the  necessary  part.  We  claim  that  there  exists  a  subset 
B”  of  B'  such  that  (T-Q- f  B")  is  a  tree  in  N’.  Suppose 
that  such  B"  exists.  Then  the  matrix 

0  1  £ 

- 1 

n  -  *  1  I 

is  nonsingular,  where  £  is  the  n  x  k  submatrix  of  £  whose 
columns  correspond  to  B"  in  V',  because  its  column-vec¬ 
tors  correspond  to  a  tree  (T-Q  +  B”).  Therefore 


H]YHr 

;_4 

w  = 

a«-*» 

1  0 
!_ 

ET 

1  0- 

is  nonsingular  and  Zmq  has  full  column-rank  for  this  Y.  To 
prove  the  claim,  note  that  (T -  Q)  has  no  loop.  Thus  each 
loop  in  (T -  Q  +  B')  must  contain  one  branch  in  B'.  Delete 
this  branch  from  (T-Q  +  B').  The  resultant  graph  is  still 
connected  and  may  contain  loops  each  of  which  contains  a 
branch  in  B‘.  Hence,  the  same  deleting  process  may  be 
repeated  until  there  are  only  n  branches  left  in  the  final 
graph  which  still  contains  t T-Q)  and  k  branches  in  B'. 
Denote  those  k  branches  by  B" .  Then  (T -  Q  +  B")  is  a 
tree  in  S'  because  (T -  Q  +  B")  is  a  connected  spanning 
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subgraph  of  N'  with  (rt  +  l)  nodes  and  ( T—Q  +  B")  con¬ 
sists  of  n  branches  [10]. 

In  the  second  and  final  step,  we  want  to  show  that  Zmq 
has  full  column-rank  for  almost  all  Y.  From  (10)  note  that 
6stW  is  a  polynomial  of  branch  admittances.  By  a  well- 
known  result  [11],  it  can  be  shown  that  det  W  is  nonzero 
for  almost  all  Y.  Therefore,  Zmq  has  full  column-rank  for 
almost  all  Y.  This  proves  the  sufficient  part.  Q.E.O. 

Note  that  the  conditions  given  in  Theorem  2  depend 
only  on  the  graph  of  N  but  not  on  the  values  of  Y.  This  is 
quite  desirable  in  testability  design  since  it  relieves  us  of 
the  consideration  of  the  values  of  Y.  However,  these  condi¬ 
tions  must  be  verified  for  every  two  branches.  It  could  be 
very  cumbersome  when  N  is  very  large.  We  will  study  the 
improvement  next. 

At  this  stage,  it  is  essential  to  introduce  some  useful 
notations  before  the  improvement  of  Theorem  2  can  be 
done.  First,  it  is  convenient  to  have  a  testing  graph  Na  of 
N.  Na  is  constructed  from  N  by  connecting  all  accessible 
nodes  to  a  new  node  a.  Then,  we  would  like  to  generalize 
the  definition  of  a  path.  A  path  is  conventionally  defined 
as  a  series  of  nodes  and  branches  incident  with  each  other 
and  contains  distinct  end-nodes.  For  our  purpose,  define  a 
single  node  as  a  null-path,  and  a  generalized-path  (abbrev. 
g-path)  is  either  a  path  or  a  null-path. 

Now,  we  are  able  to  have  the  two  lemmas  which  will  be 
used  in  the  proof  of  the  next  theorem. 

Lemma  1:  Given  N  and  its  associated  Na.  Suppose  that 
there  are  r  nodes  which  separate  Na  into  two  parts;  Parts  A 
and  B  such  that  Part  A  contains  a  and  Copart1  A  contains 
no  less  than  r  branches.  Then  the  submatrix  of  Zmh  defined 
in  (5)  corresponding  to  those  branches  in  Copart  A  has 
rank  less  than  r. 

Proof:  First,  note  that  Part  B  contains  no  accessible 
node  since  all  accessible  nodes  are  adjacent  to  node  a.  Also 
notice  that  the  rank  of  the  submatrix  of  Zm(,  is  exactly  the 
dimension  of  space  spanned  by  the  column  vectors  in  the 
submatrix.  From  (5),  this  is  equivalent  to  the  one  spanned 
by  the  vectors  Avm  generated  by  the  current  sources  each  of 
which  in  parallel  with  a  branch  is  Copart  A.  By  Thevenin 
Equivalent  Theorem,  Copart  A  and  those  current  sources 
can  be  represented  by  a  (r  -  l)-port,  as  shown  in  Fig.  2. 
Each  port  is  driven  by  a  voltage  source.  Since  there  are 
only  (r-l)-ports,  by  superposition  theorem,  the  space 
spanned  by  Aum  generated  from  these  ports  has  dimension 
no  larger  than  (r  - 1).  Therefore,  the  submatrix  of  ^ mb 
corresponding  to  the  branches  in  Copart  A  has  rank  less 
than  r.  Q.E.D. 

Lemma  2:  Given  N  and  its  associated  Nm,  the  following 
two  statements  are  equivalent; 

(a)  There  are  at  least  k  disjoint2  paths  in  Na  between  a 
and  any  inaccessible  node. 

'Copart  A  of  a  graph  G  associated  with  Part  A  is  the  subgraph  of  G 
formed  by  removing  all  the  nodes  in  Part  A  from  G. 

JTwo  paths  are  disjoint  if  they  have  no  common  node  except  the 
end-nodes.  And  they  are  absolutely  disjoint  if  they  have  no  common  node 
at  all. 


Fig.  1.  Network  of  Lemma  1. 


(b)  There  are  at  least  k  absolutely  disjoint2  g-paths  in  N 
between  M  and  any  k  nodes. 

Proof  (a)  -*  (b):  It  will  be  proved  by  contradiction. 
Suppose  that  (b)  fails  for  a  set  of  A:  nodes.  Construct  a 
graph  Nafi  from  Na  by  connecting  these  k  nodes  to  a  new 
node  /S.  Then  by  Menger’s  Theorem,  there  exists  a  set  R  of 
(k  - 1)  nodes  which  separate  NaP  into  two  parts;  Part  A 
contains  a  and  Part  B  contains  /?.  Note  that  Part  B 
contains  no  accessible  node  but  does  contain  at  least  one 
inaccessible  node.  The  latter  is  true  because  if  Part  B 
contains  no  inaccessible  node  then  /}  is  the  only  node  in 
Part  B  and  all  those  k  nodes  connected  to  ft  must  be  all 
accessible  nodes  which  automatically  satisfy  (b)  and  thus 
violate  the  assumption.  Let  v  be  such  a  node.  Then  v  is 
separated  from  a  by  the  same  R  in  both  N„  and  Nafi.  Hence 
by  Menger’s  Theorem,  there  are  at  most  (k  - 1)  disjoint 
paths  between  v  and  a  in  Na.  This  is  a  contradiction. 
Therefore,  (a)  must  imply  (b). 

(b)  -» (a):  It  will  also  be  proved  by  contradiction.  Sup¬ 
pose  that  (a)  fails  for  an  inaccessible  node  v.  Then  by 
Menger’s  Theorem,  there  exists  a  set  R  of  (k  - 1)  nodes 
which  separate  Na  into  two  parts:  Part  A  contains  a  and 
Part  B  contains  v.  For  the  k  nodes  in  R  U(u),  we  construct 
a  graph  Sa0  from  Na  by  connecting  all  nodes  in  R  U{u)  to 
a  new  node  fi.  Then  there  are  only  (k  -  1)  disjoint  paths 
between  a  and  P  in  Nafi.  Hence  in  N,  there  are  only  (k  -  1) 
absolutely  disjoint  g-paths  between  M  and  R  U(u).  This  is 
a  contradiction.  Therefore,  (b)  must  imply  (a).  Q.E.D. 

Then  we  can  have  the  following  theorem. 

Theorem  3.  Given  a  network  N  which  has  no  parallel 
branches  and  its  associated  Na,  the  following  statements 
are  equivalent. 

(a)  N  satisfies  the  single-fault  testability  condition  for 
almost  all  values  of  Y. 

(b)  There  are  at  least  three  disjoint  paths  between  a  and 
any  inaccessible  node  in  Na. 

(c)  The  local  connectivity  between  o  and  any  inaccessi¬ 
ble  node  is  at  least  3  in  Na. 

Proof  (b)  *-»  (c):  Directly  from  Menger’s  Theorem  [7], 
(a)  -*  (b):  We  will  prove  it  by  contradiction.  Let  (b) 
be  false.  Then  there  are  at  most  two  nodes  which  separate 
<VB  into  two  parts:  Part  A  contains  node  a  and  Part  B 
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contains  at  least  one  inaccessible  node.  Since  all  accessible 
nodes  are  adjacent  to  a,  they  cannot  belong  to  Part  B.  If 
Part  A  and  Part  B  are  separated  by  one  node,  then  the 
branches  in  Copart  A  has  no  effect  on  the  measurements 
on  M  and  henceforth  the  faults  in  Copart  A  cannot  be 
determined.  Now  suppose  that  there  are  two  nodes,  say  a 
and  b,  separating  Part  A  and  B.  Then  there  are  at  least  two 
branches  in  Copart  A  because  N  is  connected  and  Copart 
A  contains  at  least  three  nodes.  Therefore,  from  Lemma  1. 
the  single-fault  testability  condition  cannot  be  satisfied.  In 
either  case,  there  is  a  contradiction.  Therefore,  Statement 
(b)  must  be  true. 

(b)-»(a):  Let  bx  and  b2  be  any  two  branches  in  N. 
Branches  bx  and  b2  cannot  be  in  parallel  from  the  assump¬ 
tion,  hence  there  are  at  least  three  nodes  incident  with  bx 
and  b2.  From  Lemma  2  there  are  three  absolutely  disjoint 
g-paths  in  N  between  M  and  any  three  nodes.  Thus  there 
are  three  g-paths  between  M  and  some  three  nodes  incident 
with  bt  or  b2-  The  Statement  (a)  will  be  shown  by  con¬ 
structing  a  tree  satisfying  the  two  conditions  in  Theorem  2 
based  on  these  three  g-paths. 

First,  consider  the  case  that  bx  and  b2  have  a  common 
node.  Then  a  tree  T  can  be  constructed  by  adding  enough 
branches  to  b{,  b2,  and  the  three  g-paths.  Moreover,  T - 
{bx,  b2)  is  a  subgraph  of  three  components  each  of  which 
contains  a  g-path  and  therefore  an  accessible  node.  Thus 
T-{bx,b2)  is  still  connected  when  the  nodes  in  M  are 
shorted  together.  And  hence  T  satisfies  the  two  conditions 
of  Theorem  2. 

Next,  for  the  case  that  bx  and  b2  have  no  common  node, 
let  each  of  the  three  g-paths  contain  only  one  node  incident 
with  bx  or  b2.  This  can  be  achieved  by  shortening  a  path 
which  does  not  satisfy  this  condition.  Denote  these  g-paths 
by  Px,  P2,  and  P2.  Without  loss  of  generality,  assume  that 
b  |  joins  /*,  and  P2  and  P2  contains  one  of  the  two  nodes 
(ua,cA)  incident  with  b2  and  oc  in  M.  We  will  obtain' the 
desirable  tree  by  modifying  Px,  P2,  and  P3. 

There  are  two  sub-cases  to  be  considered  separately.  The 
first  sub-case  is  that  va,  bh  and  one  node  of  Px  U  P2  are  in  a 
nonseparable  subgraph  in  N.  Then  there  is  a  loop  that 
contains  all  three  nodes.  In  this  loop,  there  are  two  abso¬ 
lutely  disjoint  paths  each  of  which  contains  exactly  one 
node  of  b2  and  one  node  in  P,  u  P2.  Denote  the  path  that 
contains  va  by  La  and  the  other  by  Lh.  Now  in  P3,  there 
exists  a  sub-g-graph  such  that  it  contains  vc  and  exactly 
one  node,  say  o0,  in  KvLh.  Without  loss  of  generality,  we 
may  assume  that  v0  e  La.  Then  a  new  g-path  P3  can  be 
formed  by  the  union  of  the  g-path  of  P2  and  the  g-path 
between  v0  and  va  of  La.  Note  that  the  other  node  vh  of  b2 
is  connected  to  L  L2  by  the  path  Lh  which  is  absolutely 
disjoint  from  P2.  Thus  a  tree  T  can  be  obtained  by  adding 
enough  branches  to  the  union  of  bt,  b2,  Lh,  Px,  P2,  and  Pj. 
And  T-{bx,b2}  must  be  connected  when  nodes  in  M  are 
shorted  together  because  the  union  of  Lh,  Px,  P2%  and  Pj 
which  does  not  contain  f>,  or  b2  consists  of  exactly  three 
components  each  of  which  contains  at  least  one  node  in  M 
and  the  added  branches  do  not  alter  this  fact.  Therefore,  T 
satisfies  the  conditions  of  Theorem  2. 


The  remaining  sub-case  is  that  there  is  a  cut-node  v0 
between  nodes  in  P,  u  P,  and  { va,  vh).  From  Statement  (b). 
there  exist  two  absolutely  disjoint  g-paths  between  {va,  vh) 
and  St.  Let  P4,  P5  be  two  such  g-paths.  Because  i>0  is  a 
cut-node,  either  P4uP5  doesn’t  contain  any  node  of  P,UP: 
or  only  one  of  P4  and  P5  contains  nodes  of  Pt  U  P2  and  it 
must  also  contain  v0.  In  the  former  case,  it  can  be  shown 
by  arguments  similar  to  those  of  the  preceding  cases  that  a 
tree  obtained  by  adding  enough  branches  to  the  union  of 
b ,,  b2,  P,,  P;,  P4,  and  P5  satisfies  the  conditions  of  Theo¬ 
rem  2.  In  the  latter  case,  without  loss  of  generality,  assume 
that  P4  contains  u0,  va,  and  nodes  in  P,  U  P2.  Let  P4  be  the 
sub-path  of  P4  which  contains  va  and  exactly  one  node  in 
P,  U  P2.  Then  it  can  be  seen  that  the  union  of  bx,  b2.  P,, 
P2,  P4,  and  P5  contains  no  loop.  Therefore,  a  tree  in  .V  can 
be  obtained  by  adding  enough  branches  to  the  union.  And 
this  tree  satisfies  the  conditions  of  Theorem  2. 

Summing  up,  for  any  two  branches,  a  tree  T  can  be 
obtained  to  meet  the  conditions  in  Theorem  2.  Thus  by 
Theorem  2,  single-fault  testability  condition  is  satisfied  and 
Statement  (b)  is  proved.  Hence  the  proof  is  completed. 

Q.E.D. 

Comparing  Theorem  3  to  Theorem  2.  both  of  them 
provide  graphical  conditions.  However,  the  conditions  in 
Theorem  3  are  much  simpler  to  implement  and  they  only 
have  to  be  verified  once  for  every  inaccessible  node.  The 
latter  advantage  becomes  more  significant  when  N  is  large. 
The  applications  of  Theorem  3  will  be  illustrated  by  two 
examples  in  the  next  section. 

Examples 

To  demonstrate  the  versatility  of  the  above  results,  two 
examples  will  be  provided.  The  first  example  is  to  illustrate 
the  usage  of  Theorem  3  to  verify  the  single-fault  testability 
condition  while  the  second  one  will  extend  the  usage  of 
Theorem  3  to  the  choice  of  test  points  or.  equivalently, 
additional  accessible  nodes  to  satisfy  the  testability  condi¬ 
tion. 

Example  1:  Consider  a  ladder  network  N  as  shown  in 
Fig.  3(a)  with  three  accessible  nodes  a.  b,  and  c.  Construct 
the  associated  Na  by  connecting  these  three  accessible 
nodes  to  a  new  node  a.  as  shown  in  Fig.  3(b).  It  is  easy  to 
show  that  every  inaccessible  node  in  Na  has  exactly  3 
disjoint  paths  to  a.  Therefore,  by  Theorem  1  and  3,  N  is 
single-fault  testable.  In  fact,  stages  can  be  arbitrarily  in¬ 
serted  into  iV  without  destroying  the  testability. 

Example  2:  Consider  a  more  complicated  network  N ,  as 
shown  in  Fig.  4(a)  with  three  accessible  nodes  a.  b.  and  c. 
Its  associated  Na  is  shown  in  Fig.  4(b).  It  can  be  shown  that 
each  of  nodes  d .  e.  and  /  has  only  two  disjoint  paths  to  a. 
Thus  N  is  not  single-fault  testable.  To  achieve  testability, 
there  are  two  remedies.  The  apparent  one  is  to  introduce 
an  additional  accessible  node.  Any  one  of  the  nodes  d  and 
e  can  be  the  choice.  Fig.  4(c)  shows  N'a  with  the  additional 
accessible  node  d  which  can  be  shown  to  achieve  testa¬ 
bility.  Another  remedy  is  to  have  new  branches  inserted  in 
the  network.  This  insertion  can  be  either  permanent  if  the 
network  performance  can  be  compromised,  or  introduced 
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Fig.  3.  Network  of  Example  I . 


(10)  W.  K..  Chen.  Applied  Graph  Theory.  North-Holland,  1971. 
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Springer- Verlag,  1974. 
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Fig.  4.  Network  of  Example  2. 


only  during  the  testing.  Fig.  4(d)  shows  that  testability  can 
be  achieved  by  inserting  a  new  branch  p  between  nodes  g 
and  e. 
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ABSTRACT 

In  the  fault  diagnosis  of  linear  networks,  a 
method  which  requires  only  the  solving  of  linear 
equations  has  been  attracting  much  attention  { t- 
$].  This  particular  method  was  initiated  inde¬ 
pendently  by  Biernacki  and  Sandler  [11  and  Sakla, 
El-Marsy  and  Trick  [2).  Since  then,  numerous  use¬ 
ful  and  interesting  results  have  been  published  in 
a  rather  short  time.  It  is  the  purpose  of  this 
paper  to  provide  a  summary  in  this  area  of  re¬ 
search.  Furthermore,  an  example  is  given  to  show 
how  this  idea  can  be  "stretched"  to  apply  to  those 
cases  which  normally  would  not  be  applicable. 

This  is  achieved  by  use  of  multiple  frequency 
testing  and  fault-logic  equations. 

I .  A  Theory  of  Fault  Diagnosis 


i.e.,  apply  same  i,  to  the  original  network  N  and 
the  perturbed  netwoek  N+AN. 


Denote  Zn  and  partition  it  in  the  following 

way: 


Zn 


V1 


* 

z« 

Ztn 


(4) 


where  Zm  e  Cm*n,  Zjn  e  C**",  and  l«n-m.  Under 
this  condition,  it  can  be  shown  that 


Zmb  Jb  “  Avm  (5) 

where 

Zmb  ”  Z«m’A 

and 

Jb  *  -AYb  (vb*Avb>  (7) 


Let  N  be  a  connected,  b-branch,  (n-M)-node, 
(m*l) -terminal,  linear,  t irae- invariant  lumped  net¬ 
work.  Out  of  the  (n*l)  nodes,  (m*l)  are  accessi¬ 
ble  terminals  for  both  excitation  and  measurement . 
In  this  section,  the  following  notstions  are  used 
for  N. 

vm  -  terminal  (accessible  node)  voltage  vector. 
ia  -  terminal  current  vector. 
vb  -  branch  voltage  vector, 
ib  -  branch  current  vector. 

Yb  -  branch  admittance  matrix. 

Y„  -  node  admittance  matrix. 

Note  that  Yb  and  Yn  need  not  be  symmetrical,  and 

‘b  "  *b  vb  (1) 

Yn  -  A  Yb  A*  (2) 

where  A  is  the  incident  matrix  of  the  graph. 

Now  consider  that  the  network  N  is  perturbed 
to  (N*AN)  in  the  way  that  Yb  is  perturbed  to 
Yb+&Yb,  and  the  graph  remains  the  same.  In  the 
perturbed  network,  we  denote  the  corresponding 
voltages,  currents,  and  admittances  by  v^+av,,, 
im+iim*  *b*Avb,  ib+Aib.  •nd  *n**Yn>  respectively. 

Given  Yb  and  the  graph,  the  purpose  of  the 
fault  diagnosis  is  to  locate  the  nonzero  entries 
of  AYb  from  the  information  i^,  Ain,  and  Avm. 

Note  that  va  can  be  calculated  from  ig  and  N. 
Without  loss  of  generality,  we  may  assume  that 

it*  -  0  (3) 


In  the  above  equation,  a  row  of  AYb  is  entirely 
zero  if  and  only  if  the  corresponding  branch  is 
fault-free.  Hence,  a  nonzero  entry  of  jb  indi¬ 
cates  that  the  corresponding  branch  is  faulty. 

The  converse  is  also  true  if  the  rare  possibility 
of  cancellation  is  disregarded.  Since  in  Eq.  (5), 
both  ZBb  and  Ava  are  known,  we  can  solve  the  equa¬ 
tion  for  jb  and  thus  obtain  the  location  of  the 
faulty  branches.  The  usage  of  Eq.  (S)  to  locate 
the  branch  faults  is  called  branch  diagnosis  [1,2, 
4). 

Alternatively,  Eq.  (5)  to  (7)  can  be 
rearranged  into  the  following  equations: 

Zmn  3n  *  Avm 

where 

Jn  *  *  jb.  (9) 

In  Eq.  (9),  sn  entry  of  jn  is  nonzero  only  if  the 
corresponding  node  is  incident  with  st  least  one 
faulty  branch.  The  converse  is  almost  slways  true 
in  practicsl  cases.  Define  a  faulty  node  as  the 
one  incident  with  at  least  one  faulty  branch. 

Then  we  can  solve  Eq.  (8)  for  jn  to  locate  the 
faulty  nodes.  Once  the  fsulty  nodes  sre  located, 
the  faulty  branches  can  be  determined  from  Eq. 

(9).  This  approach  is  cslled  node  diegnoeis 
( 3 , S 1 .  Brsnch  diagnosis  snd  node  diagnosis  are 
the  two  major  approaches  to  locste  the  faults  by 
solving  linear  equations. 

There  is  a  unique  problem  shared  by  both 
approaches,  i.e.,  they  all  depend  on  solving  the 
equation  with  the  following  form: 
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where  the  entriea  of  the  matrix  B  and  the  vector  p 
art*  known  and  are  in  the  complex  field.  Moreover, 
Eq.  (10)  in  the  fault  diagnosis  uaually  haa  more 
unknown  variablaa  than  equationa.  Therefore,  the 
aolution  ia  not  unique  in  general.  Fortunately, 
in  moat  practical  caaea,  there  are  only  a  amall 
number  of  branch  faulta,  thua  a  different  but 
atill  amall  nuartier  of  faulty  nodea,  in  a  faulty 
network.  Taking  advantage  of  thia  fact,  we  can 
reatrict  the  aet  of  aolutiona  x  to  thoae  with  only 
a  amall  number  of  noncero  entriea,  and  then  the 
solution  can  be  unique.  Thia  will  be  diacuaaed 
next. 

Reconsider  Eq.  (10), 

B  x  ■  p  (10) 

where  x  e  C",  p  e  C",  B  c  C**"  and  n  >  m.  Let 

Xh  ■  (x  c  C"  |  number  of  non-zero  components 
of  x  not  more  than  than  k). 

We  want  to  investigate  the  uniqueness  of  the 
solution  x  c  Xk. 

Definition.  The  global  column-rank  of  B  ia 
aaid  to  be  r  if  every  combination  of  r  column a  of 
B  is  linearly  independent,  and  some  combination  of 
(r*l)  columns  of  B  ia  linearly  dependent. 

Let  a  be  the  range  of  B.  Then  the  following 
theorem  ia  given  in  (3]. 

Theorem  1 .  Let  p  c  Q.  Then  Eq.  (10)  haa  a 
unique  solution  x  t  Xk  for  elaust  all  p  e  9  if  and 
only  if  the  global  column-rank  of  B  is  at  least 
k*l. 

Note  that  for  fault  diagnoais  problem,  p  c  0 
is  guaranteed.  When  Theorem  1  ia  satisfied,  the 
solution  of  (10)  can  be  obtained  in  the  following 
way.  Let  the  coluam  vectors  of  B  be  denoted  by 

B  *  Ibj.bj . bn] 

Let  the  test  auitrices  Tj  be  constructed  in  the 
following  way: 

Ti  *  ^ijfbi^i*  *  *  ,  i  *  1,2,...,  a,  a  ■  C(n,k) 

for  every  combination  of  k  columns  of  B.  Then, 

(A)  If 

Rank  Tj  ■  k  ♦  1,  i  *  1,2, ...,a  (11) 

then  there  is  no  solution  x  c  Xk. 

(B)  Otherwise,  there  ia  a  unique  j  such  that 

Rank  Tj  •  k  (12) 

and 

Rank  Tj  •  1  ♦  1  ,  i  *  j.  (13) 

and  the  unique  solution  x  c  Xj,  is  given  by 


x  •  (BjTgj)-l  Bjp  (14) 

where  Bj  -  (b jj  ,bj2, . . .  ,bjkl . 

The  above  result  can  now  be  applied  to  the 
node- fault -equation  (8)  to  locate  uniquely  the 
faulty  nodea  if  the  number  of  faulty  nodes  ia 
strictly  leas  than  the  global  column-rank  of  Z^. 
The  application  to  the  branch  diagnosis  needs  more 
elaboration  and  can  be  found  in  [4].  Note  that 
thia  result  can  also  be  applied  to  any  other  fault 
equation  having  the  form  of  (10). 

The  remaining  big  question  is  that  whether  or 
not  the  theory  developed  above  is  robust,  i.e., 
whether  or  not  the  theory  based  on  the  nominal 
circuit  is  still  valid  for  the  actual  circuit 
whose  parameters  always  differ  from  those  of  the 
nominal  circuit.  The  answer  depends  on  how  the 
theory  is  implemented.  One  such  implementation 
will  be  presented  next.  Note  that  (11),  (12),  and 
(13)  are  equivalent  to  the  question  that  whether 
or  not  n  is  a  linear  combination  of  Bj  «  (b£j .bij, 
••••bik'-  The  question  haa  an  answer  in  the 
residual  number 

e 0  “  PT(I-Bi  Bj+)p 

where  Bj*  is  the  pseudoinverse  of  Bj.  It  is  known 
that  e°a0  if  and  only  if  p  is  a  linear  combination 
of  coluam  vectors  of  B:.  It  can  be  shown  that 
thia  test  ia  robust  (7).  This  implementation  was 
also  suggested  in  (1). 

An  important  development  should  also  be  noted 
that  the  global  column-rank  of  Z^,  and  Zm  can  be 
shown  that  it  depends  mainly  on  the  graph  of  N, 
not  on  the  circuit  parameters  [3,4]. 

So  far,  we  have  considered  the  case  with 
single  test  signal  and  with  single  frequency.  In 
the  next  section,  we  will  show,  by  an  example,  how 
the  theory  can  be  "stretched"  when  multiple  fre¬ 
quencies  are  used  and  how  a  solving  of  nonlinear 
equations  can  be  avoided  by  use  of  fault-logic 
equat ions. 

II .  Implementation  on  an  Active  Filter 

Consider  a  3-pole  low-pass  active  filter  as 
shown  in  Fig.  1.  We  are  to  locate  the  single 
faults  (resistors  and  capacitors)  in  this  network 
by  using  the  input  and  output  terminals  as  the 
only  accessible  nodes.  It  can  be  easily  shown 
that  the  global  coluam-rank  of  both  Z,^  and  Zm  ia 
one,  and  therefore  according  to  the  theory  the 
circuit  is  not  one-fault  diagnosible.  We  will 
show  how  to  use  aniltiple  frequency  and  fault-logic 
equationa  to  alleviate  the  problem. 

(Xir  algorithm  ia  hierarchical  in  essence, 
i.e.,  the  fault  ia  first  isolated  into  a  subnet¬ 
work  from  a  single  teat  signal  and  then  thia 
faulty  subnetwork  is  further  diagnosed  by  the 
usage  of  multiple  test  frequencies.  The  algorithm 
will  be  described  as  follows. 
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Step  Ai  The  circuit  is  first  reformulated 
into  a  block  diagram  as  shown  in  Fig.  2.  The  re¬ 
lationship  between  the  transfer  functions,  denoted 
by  A^'s  snd  the  R's  snd  C's  is  given  as  follows: 


A1  "  R1 
A2  ■  l/CSCj) 

A3  **  i/r2 

A4  ■  1/R7 

a5  -  -i/(c3+sc2) 

A6  -  *5/ (R4R4SC3) 


(15) 


1^  is  the  input,  and  and  Va  are  the  outputs. 

The  extra  signal  sources  j£,  i“l,2,...,6,  at  the 
output  of  the  blocks  Aj  is  the  fault  compensator 
of  the  block  A^.  The  meaning  of  the  fault  compen¬ 
sator  is  given  in  Fig.  3.  Note  that  if  Aj  is 
changed  to  (A^kAAj)  then  J^aU^AA{,  where  is  the 
input  to  Aj.  Assuming  that  UjSO,  we  have  Jj*0  if 
and  only  if  AAj*>0.  Our  first  step  is  to  locate 
non-zero  Jj's  from  Vj  and  Va. 


The  relation  between  (Vq,v2)  and  circuit  parame¬ 
ters  R's  and  C's  is  nonlinear.  In  order  to  avoid 
solving  nonlinear  equations,  we  propose  to  do  the 
following.  To  save  subscripts,  denote  Vq  by  V  and 
V2  by  W. 


Assume  that  Bj  ♦  ( B  j*AB  j  ) ,  i*t,2,3,4. 
Exciting  the  ciruit  by  k  different  frequencies, 
the  above  equation  becomes 


»  9 

•  9 

’ 

SiV,  S2Vj  S3Vj  -Wj 

ABj 

S1V1  s2v,  s3Vj  -W, 

®1 

S2V2  S2V2  S3V2  -W2 

as2 

S2V2  s22v2  s3v2  -w2 

»2 

•  • 

■ 

•  • 

•  e 

ABJ 

•  • 

»3 

SfcVk  s2vk  s3vk  -wk 

AB4 

SfcVfc  s2vk  s3vk  -wk 

B4 

k  k 

k  k 

4 

From  Fig.  2,  it  can  be  shown  that 


►  — 

»  • 

a 

AVj 

1  1  0  0  0  0 

J1 

J2 

flVo 

- 

0  A3A5A6Aq  A5A4AQ  A5A4AQ  A6Aq  ag 

J3 

J4 

J5 

k  S 

9  a* 

J6 

where  Ag«l/ ( I-A4A5A5) .  This  equation  has  the  form 
of  (10).  Therefore  Theorem  1  can  be  applied. 

Since  the  last  four  column-vectors  are  linearly 
dependent,  we  cannot  distinguish  among  (J^^.Jj, 
J$).  Let  the  last  four  column-vectors  be  grouped 
into  one.  Then  the  resulting  matrix  has  three 
columns  and  whose  global  column-rank  is  2.  There¬ 
fore,  it  is  one-fault  diagnosable,  i.e.,  we  may 
determine  whether  Aj,  A2  or  one  of  (A3 ,A4 ,A;  .A^) 
is  at  fault.  In  view  of  (15),  by  use  of  single 
signal  testing,  we  can  isolate  the  fault  into 
three  groups:  Rj ,  Cj  and  the  rest.  This  is 
established  in  Table  la. 


Step  B:  The  faulty  subnetwork  isolated  by 
Step  A  can  be  further  diagnosed  by  the  usage  of 
multiple  test  frequencies.  First,  observe  that 
the  only  part  that  requires  further  diagnosis  is 
the  group  of  (A3,A4,A^,A6).  It  csn  be  described 
by  the  transfer  function  of  Vq/V2  where  Vq  is  the 
output  voltage  of  the  circuit  and  V2  is  the  output 
of  A2  and  which  can  be  calculated  if  A2  is  fault- 
free.  This  can  be  determined  from  Step  A.  The 
expression  of  V0/v2  is  given  by 


v2  BjS*B2S2*B3S3 

where  S  is  the  complex  frequency,  and 
Bi  -  I/R7 

B2  -  (R4R$C3)/(R3R5)  (18) 

*3  ”  (BARbCzC^/Rs 
b4  •  1/R2 


We  need  to  determine  non-zero  ABj's  from  (19). 
Since  the  R.H.S.  is  known,  Eq.  (19)  has  the  form 
of  (10).  Therefore,  Theorem  1  again  can  be 
applied.  When  the  k  different  frequencies  are 
properly  chosen,  the  global  coluaxi-rank  of  the  L. 
H.S.  matrix  is  three.  Therefore,  we  can  uniquely 
determine  two  non-zero  ABj's.  From  these  non¬ 
zero  ABj's,  the  faulty  circuit  parameters  can  be 
located  from  (18)  by  a  logic  argument.  This  can 
be  presented  formally  as  follows. 

From  (18),  a  set  of  fault-logic  equation  is 
constructed: 

F(Bj)  -  F(R7) 

F(B2)  -  F(R3)  V  F(R4)  V  F(R5)  V  F(R6)  V  F(C3) 
f(b3)  -  F(C2)  V  F(R4)  V  f(r5)  V  f(r6)  V  f(c3) 

F(B4>  -  F(R2)  (20) 

The  first  fault-logic  equation  reads  that  ABj  is 
non-zero  if  and  only  if  AR 7  is  non-zero.  The  sec¬ 
ond  fault-logic  equation  reads  that  AB2  is  non¬ 
zero  if  and  only  if  AR3,AR4,AR5,AR«,  or  AC3  is 
non-zero.  Similarly  for  the  third  and  the  fourth 
fault  logic  eqations.  We  have  assumed  that  no  two 
faults  wll  occur  in  such  a  way  that  their  effects 
will  cancel  each  other,  which  is  rare  in  practice. 

Finally,  we  need  to  solve  the  set  of  fault- 
logic  equations  (20).  This  can  be  done  by  use  of 
a  truth  value  table.  The  solution,  together  with 
the  solution  in  Step  A,  is  given  in  Table  1.  An 
entry  0  means  that  AAj*0  in  Table  la  and  ABjSO  in 
Table  lb.  An  entry  X  means  AAj»0  in  Table  la  and 
ABj"0  in  Table  lb.  For  example,  the  second  row  of 
Table  la  resda  that  if  AA2*0,  and  AAi*AA3'*AA4<>AA3" 
AA$“0  then  Cj  is  faulty.  The  fifth  row  of  Table 
lb  reads  that  if  AB2«0,  AB3#0,  and  ABj"AB4"0  then 
a  fault  occurs  among  (84 .Rj.R^.Cj).  The  reason 
that  the  last  group  of  circuit  parameters  cannot 
be  further  distinguished  can  be  easily  found  from 
the  circuit.  They  are  related  as  a  single  product 
and  therefore,  they  cannot  be  further  diagnosed 
unless  additional  test  points  are  introduced. 


>  . 
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This  iap learnt at  ion  has  many  interesting 
points: 

1.  Tha  fault  diagnosis  decision  is  based  on 
■etching  fault  patterns  (Table  1).  It  is  like  the 
fault  dictionary  approach  for  fault  diagnosis  of 
digital  systems.  It  is  extreswly  simple  from  the 
user's  point  of  view.  Such  table  is  called  fault 
directory  (8  J. 

2.  The  post-fault  computation  is  miniaul. 

The  only  computation  required  is  the  calculation 
of  residuals.  Its  complexity  depends  on  the 
number  of  faults,  not  the  sise  of  the  network. 

3.  The  use  of  multiple  frequency  and  fault- 
logic  equation  avoids  a  complex  and  non-robust 
computation  of  solving  a  set  of  nonlinear  equa¬ 
tions. 

4.  Our  computer  sisxilation  study  [8]  shows 
that  this  method  of  isqilementat ion  is  very  robust. 
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A1 

a2 

a3 

*4 

a5 

a6 

Faulty  Components 

0 

X 

X 

X 

X 

X 

*1 

X 

0 

X 

X 

X 

X 

C1 

X 

X 

0 

0 

0 

0 

Others 

Table  la.  Fault  Patterns  for  Step  A. 


*1  *2  b3  b4 

Faulty  Components 

0  X  X  X 

*7 

X  0  X  X 

«3 

X  X  0  X 

c2 
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0.  THE  BACKGROUND 

During  the  past  quarter  century,  the  engineering  community  has  been  wit¬ 
ness  to  tremendous  strides  in  the  art  of  electronics  design.  On  the  contrary, 
electronics  maintenance  has  changed  little  since  the  day  of  the  vacuum  tube. 

As  such,  our  ability  to  design  a  complex  electronic  circuit  is  quickly  out¬ 
distancing  our  ability  to  maintain  it.  In  turn,  the  price  reductions  which 
have  accompanied  modern  electronics  technology  have  been  paralleled  by  in¬ 
creasing  maintenance  and  operation  costs.  Indeed,  many  industries  are  finding 
that  the  life  cycle  maintenance  costs  for  their  electronic  equipment  now  ex¬ 
ceeds  their  original  capitol  investment. 

Given  the  above,  it  is  quickly  becoming  apparent  that  the  electronics 
maintenance  process,  like  the  design  process,  must  be  automated.  Unfortunate¬ 
ly,  the  50  years  of  progress  in  circuit  theory,  on  which  our  electronics  de¬ 
sign  automation  has  been  predicated,  does  not  exist  in  the  maintenance  area. 

As  such,  the  past  decade  has  witnessed  the  inauguration  of  a  basic  research 
program  to  lay  the  foundations  for  a  theory  of  electronics  maintenance  and  a 
parallel  effort  to  develop  operational  electronic  maintenance  codes. 

Thus  far  the  greatest  success  has  been  achieved  in  the  digital  electron¬ 
ics  area  to  the  point  that  commercialized  test  programs  are  now  readily  avail¬ 
able.  On  the  other  hand,  the  analog  testing  is  still  in  its  infancy.  This  is 
not  without  reasons. 

JE  .  ■  . - -  —  -  ■  '  — —  ■ 
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For  one  reason,  the  analog  fault  diagnosis  had  a  late  start.  The  re¬ 
search  and  theory  development  of  digital  testing  started  in  the  mid  1960 's 
when  the  large-scale  computers  were  readily  available.  Not  until  a  decade 
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later  did  a  commercialized  test  program  first  become  available.  On  the  other 
hand,  it  was  not  until  the  mid  1970's  that  the  test  technology  community  began 
to  face  up  to  the  analog  test  problem.  Indeed,  even  in  a  predominantly  digi¬ 
tal  world,  analog  systems  were  not  disappear ing.  Analog  systems  were  proving 
to  be  among  the  most  unreliable  and  least  readily  tested  of  all  electronic 
systems.  Assuming  the  same  speed  for  the  development  of  digital  testing,  a 
commercially  available  analog  testing  program  would  not  have  been  ready  until 
the  mid  1980's. 

The  main  reason  is  that  the  analog  fault  diagnosis  has  inherited  certain 
difficult  problem#  which  are  not  shared  by  digital  fault  diagnosis.  These 
will  be  explained  later. 

I.  INTRODUCTION 

One  can  trace  as  far  back  as  the  early  1960's  to  find  that  circuit 
theorists  had  an  interest  in  the  analog  fault  diagnosis  problem  [1,2]. 

However,  there  was  only  sparse  interest  thereafter  [3-9].  It  was  not  until 
1977  that  a  collection  of  papers  [10]  appeared,  and  a  special  issue  followed 
[11].  In  the  meantime,  the  interest  among  circuit  theorists  suddenly  became 
active  [12-52],  An  excellent  review  paper  with  extensive  references  for  the 
pre-1979  period  appeared  in  the  special  issue  [12]. 

In  this  paper,  only  the  post-1979  activities  will  be  discussed.  The 
Fault /Tolerance  Compensator  approach  will  be  extensively  discussed  because  it 
is  probably  the  most  promising  approach  at  the  present  time,  (based  on  the 
authors'  prejudiced  point  of  view?) 
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As  a  start,  a  simplified  analog  fault  diagnosis  problem  is  presented  in 
Section  II  and  some  major  issues  are  discussed  in  Section  III.  The  Element- 
Value  Solvability  Problem  is  discussed  in  Section  IV.  A  fault/tolerance  com¬ 
pensation  model  is  presented  in  V.  The  k-Fault  Diagnosis  Problem  is  the  ideal 
case  (Section  VI)  and  the  tolerance  case  (Section  VII)  follows.  A  conclusion 
is  given  in  Section  VIII. 
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Since  the  constraint  m  <<  n  is  imposed  upon  us ,  we  have  to  ask  the  next 
realistic  question.  Suppose  that  a  perturbation  (y  +  Ay)  is  observed  at  the 
output  and  we  want  to  locate  which  entries  of  A  have  been  perturbed.  In  this 
case,  it  is  quite  realistic  to  assume  that  the  number  of  entries  of  A,  say  k, 
were  perturbed  at  an  instant  when  a  perturbation  was  observed  at  the  output  is 
small  (k  «  n).  In  other  words,  AA  is  a  sparse  matrix.  Furthermore,  we  need 
only  to  determine  the  location  of  non-zero  entries  of  A,  not  their  values.  We 
can  now  pose  the  second  problem. 

Problem  2:  (k-Fault  Diagnosis:  The  ideal  case) 

With  the  constraint  ra  <<  n,  and  the  assumption  that  AA  is  sparse,  can  we 
determine  the  location  of  non-zero  entries  of  AA  from  the  input/output 
measurements? 

It  turns  out  that  if  certain  conditions  are  met,  we  can  uniquely  determine  the 
non-zero  rows  of  AA.  (Fortunately,  this  information  is  enough  for  us  to 
locate  the  faulty  circuit  elements.)  This  can  be  seen  as  follows. 

Subtracting  (1)  from  (2),  one  obtains 

CA”*  AA(x  +  Ax)  *  -Ay  (3) 

or 

CA"1  Z  -  -Ay  (4) 

and 

Z  -  AA(x  +  Ax)  (5) 

Note  that  since  CA-1  is  fat  (m  <  n) ,  the  determination  of  AA  from  (3)  is  not 
possible.  On  the  other  hand,  barring  cancellations,  it  can  be  seen  from  (5) 
that  a  component  of  Z  is  non-zero  if  and  only  if  the  corresponding  row  of  AA 
is  non-zero.  Therefore,  the  problem  is  reduced  to  the  location  of  non-zero 
components  of  Z  from  (4)  when  the  matrix  CA"1  is  fat.  A  satisfactory  solution 
to  this  problem  can  be  found  in  [30],  and  it  will  be  presented  later. 
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In  the  final  case,  the  problem  of  tolerance  will  be  introduced.  In 
practice,  System  (1)  can  not  be  built  precisely  according  to  the  specified  A, 
i.e.,  the  nominal  A.  A  specified  tolerance  usually  is  also  given  so  that  an 


actual  system  (A  +  AAt)  is  said  to  be  "good"  if  AAt  is  within  the  tolerance. 

In  this  case,  the  perturbation  AA  will  have  two  components, 

AA  ■  AA^  +  AAf  (6) 

where  AAt  is  the  deviation  of  actual  A  from  nominal  A  due  to  the  tolerance, 
and  AAf  is  due  to  some  large  deviation  (fault)  of  some  entries  of  A.  It  is 
realistic  to  assume  that  AAf  is  not  sparse  but  the  value  of  each  entry  is 
"small"  and  that  AAf  is  sparse  but  the  values  of  non-zero  entries  are  "large". 
Now,  we  can  state  the  third  problem. 

Problem  3.  (k-Fault  Diagnosis:  The  tolerance  case) 

With  the  constraint  ra  <<  n,  and  the  assumption  that  AAf  is  small  and  AAf 
is  sparse  but  non-zero  entries  are  large,  can  we  locate  the  non-zero 
entries  of  AAf  from  the  input/output  measurements? 

The  above  problem  can  be  posed  in  two  different  kinds  of  setting. 

1.  The  Problem  of  Robustness .  We  may  design  a  testing  program  (for 
k-fault  isolation)  based  on  the  ideal  case,  and  then  put  the  testing  program 
into  a  simulation  test  to  see  if  this  program  is  robust  under  the  tolerance 
case.  This  is  the  state-of-the-art  at  the  present  time.  It  turns  out  that  to 
design  a  robust  testing  program  is  a  very  tough  problem.  For  one  thing,  any 
robust  testing  program  should  avoid  the  inversion  of  a  matrix,  or  keep  it  at  a 
minimum. 

2.  The  Fault  Decision  Problem.  This  tolerance  problem  can  also  be  posed 
as  one  of  decision/detection  problem.  This  can  be  seen  as  follows.  Let 

Zf  *  AAf(x  +  Ax) 

Zf  *  AAf (x  +  Ax) , 
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Chen  Eq.  (4)  can  be  represented  in  Fig.  1.  In  chis  figure,  is  generally 
small  while  the  non-zero  entries  of  Zf  are  generally  large,  but  sparse.  The 
problem  is  to  locate  the  non-zero  entries  of  Zf  from  Ay.  This  is  a  non- 
conventional  dec  is ion/ detect ion  problem.  At  the  present  time,  it  is  still  an 
open  problem. 

At  this  point,  it  is  important  to  notice  that  when  the  number  of  test 
points  is  reduced,  the  nature  of  the  problem  changed  completely  from  Problem  1 
to  Problem  2;  and  that  when  the  tolerance  problem  is  introduced,  the  issue  of 
computation  becomes  more  complex. 

Finally,  note  that  what  has  been  presented  is  a  simplified  analog  fault 
diagnosis  problem.  In  general,  the  matrix  A  is  a  (nonlinear)  function  of 
circuit  parameters  as  well  as  frequencies. 


III.  IMPORTANT  ISSUES  OF  ANALOG  FAULT  DIAGNOSIS 
In  this  section,  we  will  discuss  two  major  issues  of  analog  fault 
diagnosis  problems  and  three  important  measures  of  the  effectiveness  of  a 
testing  program. 

The  two  major  issues  are  the  tolerance  problem  and  the  problem  of 
modeling  and  simulation  of  faulty  components: 

•  Tolerance:  Possibly  the  single  greatest  unknown  in  the  design  of  an 
analog  testing  program  is  the  effect  of  the  tolerances  of  the  "good" 


component  on  the  performance  of  a  testing  program.  This  tolerance  problem 
has  absolutely  no  counterpart  in  the  digital  testing  problem.  The  effect 
of  these  tolerances  can  completely  dominate  the  performance  of  a  testing 
program.  In  an  analog  circuit,  unlike  digital  circuits,  the  actual  values 
of  circuit  parameters  almost  always  deviates  from  the  nominal  values. 
Therefore,  any  analog  testing  program  has  to  face  up  to  the  problem  of 


tolerance  problem. 
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•  Modeling  and  Simulation  of' Faulty  Components:  Unlike  the  digital  testing, 
a  complete  modeling  (and  thus  simulation)  of  faulty  components  is  not 
available  for  the  development  of  a  testing  problem.  The  modes  of  faulting 
is  too  many  to  encounter.  For  example,  a  faulty  resistor  may  have  an 
infinite  number  of  possible  resistances  (outside  of  the  tolerance).  In 
fact,  it  can  even  be  nonlinear.  A  faulty  capacitor  may  have  a  model  of 
parallel  RC.  A  faulty  operational  amplifier  may  have  a  model  of  22 
transistors  12  resistors  and  a  capacitor!  A  good  transistor  may  behave 
like  a  faulty  one  if  its  bias  is  switched  due  to  a  fault  which  occurred 
elsewhere!  In  fact,  in  a  nonlinear  analog  environment,  we  are  still  in 
the  process  of  developing  viable  CAD  models  for  nominal  devices,  let  alone 
for  faulty  devices.  As  such,  a  thorough  test  of  the  performance  of  a 
testing  program  is  impossible.  Furthermore,  each  testing  program  has  to 
be  designed  based  soley  on  the  nominal  values  of  the  circuit. 

Besides  the  two  major  problems  mentioned  above,  there  are  three  important 
measurements  of  the  effectiveness  of  a  testing  program. 

•  Test' Points.  Due  to  the  practical  restriction  that  there  are  usually  only 
a  few  nodes  accessible  for  measurement  and  testing,  the  number  of  required 
test  points  has  to  be  as  small  as  possible . 

•  Post-Faulf  Computation.  Since  the  post-fault  computation  is  directly  re¬ 
lated  to  the  per  unit  cost,  _i£  important  to  keep  the  post-fault  compu¬ 
tation  time  short  and  simple. 

•  Robustness .  This  issue  has  been  raised  many  times;  only  because  it  is  in¬ 
deed  the  dominant  issue  at  hand.  We  want  the  testing  program  to  be  relia¬ 
ble  when  the  fault/tolerance  ratio  is  small. 


IV.  THE 


•VkUM-  SOLVABILITY  PROBLEM 


ELEMENT- 

The  element-value  solvability  problem  is  initiated  by  R.S.  Berkowitz  [1] 
in  1962.  This  problem  is  presented  in  Fig.  2.  The  network  N  consists  of  RLC- 
elements,  possibly  with  dependent  sources.  There  are  m  accessible  terminals. 
The  problem  is  to  determine  the  circuit  parameters,  RLC,  from  the  measurements 
at  the  accessible  terminals. 

It  is  easy  to  see  that  at  the  terminals  one  can  at  most  measure  the  m- 
port  input  impedance  matrix  H(S)  and  this  can  be  done,  for  example,  by  system 
identification  methods.  The  transfer  function  matrix  is  a  function  M  of  fre¬ 
quency  S  and  circuit  parameter  p,  i.e., 

M(S,p)  -  H(S) 

The  problem  is  to  solve  for  p  from  H(S).  In  other  words,  it  is  a  m-port 
network  synthesis  problem  when  the  graph  and  the  element-kind  is  prescribed. 

In  general,  M  is  a  nonlinear  function  of  p.  As  a  rule  of  thumb,  when  the 
number,  ra,  of  accessible  nodes  (test  points)  increases,  M  becomes  "less  non¬ 
linear",  and  vice  versa.  This  is  an  important  trade-off  problem.  In  this  re¬ 
gard,  there  are  two  standard  results  to  be  quoted  constantly  as  a  measuring 
stick.  Trick,  Mayeda  and  Sakla  [15]  have  shown  that  if  all  nodes  are  accessi¬ 
ble  then  M  becomes  linear  and  p  can  be  uniquely  determined. 

On  the  other  hand,  for  a  one-port  RC-ladder,  their  element  values  can 
always  be  determined,  regardless  of  the  number  of  stages.  This  follows  from  a 
well-known  RC  synthesis  theorem.  This  example  shows  that  there  exists  a 
circuit-type  that  regardless  how  large  the  circuit  is,  the  circuit  parameters 
can  be  determined  from  only  two  accessible  nodes.  However,  in  this  case  M  be¬ 
comes  "extremely  nonlinear".  To  see  this,  a  4-stage  RC  ladder  is  studied 


[27].  The  continue-fractional  expansion  method  is  used  to  determine  the  cir¬ 
cuit  values.  A  striking  result  has  been  found.  When  the  significant  digits 


used  for  computation  is  eight,  the  answer  came  out  correctly.  However,  when 
it  is  reduced  to  seven,  the  answer  becomes  erroneous.  In  fact,  some  values  of 
R  and  C  become  negative!  This  shows  that  the  RC  ladder  is  solvable  in  theory, 
but  it  will  yield  erroneous  conclusions  even  with  a  slightest  computation 
error.  Therefore,  it  becomes  unreliable. 

In  between  the  above  two  extreme  cases  of  the  computation-test  point 
trade-off,  the  most  elegant  result  is  found  by  Navid  and  Willson  [13].  First, 
their  solvability  condition  is  on  the  topology  not  on  the  element  values. 
Therefore ,  it  can  be  tested  very  easily.  Second,  the  number  of  test  points 
required  is  not  too  large,  but  roughly  the  square  root  of  the  number  of 
branches.  Finally,  it  is  computationally  tractable,  although  a  set  of  non¬ 
linear  equations  has  to  be  solved. 

A  necessary  and  sufficient  condition  on  local  diagnosability  of  nonlinear 
circuits  is  elegantly  derived  by  Visvanathan,  Sangiovanni-Vincentelli  [39]  and 
Saeks  et  al.  [40],  This  is  an  important  contribution  because  it  provides  a 
theoretical  limit.  It  will  be  most  helpful  to  see  if  such  a  condition  can  be 
made  robust  in  the  presence  of  tolerance. 

In  summary,  the  advantage  of  the  element  value  solvability  method  is  that 
it  avoids  the  tolerance  problem  because  all  parameters  are  calculated  and  can 
be  compared  to  see  if  they  fall  within  the  tolerance.  The  method  also  can  be 
applied  to  the  case  when  the  number  of  faulty  elements  is  large.  The  major 
difficulty  with  this  method  is  that  they  have  to  solve,  in  general,  a  set  of 
nonlinear  equations  every  time  they  are  tested.  The  number  of  nonlinear 
equations  to  be  solved  is  no  less  than  the  number  of  internal  parameters, 
which  in  general  is  large.  In  order  to  make  this  method  work,  we  need  a 
computational  procedure  which  is  robust  for  a  large  number  of  nonlinear 


equations. 
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V.  A  fault/tolerance  compensation- model 

In  this  section,  a  model  is  introduced  which  can  be  used  effectively  for 
fault  diagnosis  problems.  This  can  be  simply  explained  in  Fig.  3. 

Suppose  that  the  actual  admittance  (Fig.  3b)  of  particular  branch  is 
deviated  from  its  nominal  admittance  y  (Fig.  3a)  by  Ay.  The  deviation  may  be 
caused  by  the  tolerance  (in  this  case  Ay  is  small),  or  by  a  fault  (in  this 
case  Ay  is  large).  According  to  a  circuit  theory,  the  deviation  can  be  com¬ 
pensated  by  a  current  source  j  which  depends  on  the  deviation  Ay  and  its 
branch  voltage.  This  is  true  whether  Ay  is  linear  or  nonlinear  (in  this  case, 
a  small  signal  is  assumed) .  The  value  of  j  in  general  is  a  complex  number 
(even  when  y  is  real)  and  it  is  "large"  if  the  deviation  is  caused  by  a  fault, 
and  it  is  "small"  if  by  a  tolerance.  Hence,  the  fault  diagnosis  problem  be¬ 
comes  a  detection  problem  of  whether  j  is  large  or  small. 

The  same  is  true  for  a  three-terminal  device  as  shown  in  Fig.  4.  Here  Y 
is  a  2x2  admittance  matrix.  We  need  two  current  sources  to  compensate  the 
deviations.  If  the  device  is  a  "good"  transistor,  Y  is  its  linear  model  and 
the  compensators  represent  the  errors  caused  by  the  linear  approximation  to 
the  nonlinear  transistor.  If  the  device  is  a  "faulty"  operational  amplifier, 
the  compensators  may  represent  the  large  deviations  of  the  gain,  or  even  the 
deviations  caused  by  its  internal  transistors  and  capacitors.  In  fact,  this 
model  can  be  applied  to  any  three-terminal  device  or  any  three-terminal 
sub-circuit . 

In  general,  an  n-terminal  device  (chip)  needs  (n-1)  current  sources  to 
compensate  its  deviations.  When  all  actual  devices  are  replaced  by  their  com¬ 
pensation  model,  we  have  a  Fault/Tolerance  Compensation  Model  for  the  actual 
circuit.  In  this  model,  it  consists  only  of  the  nominal  circuit  excited  by 
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actual  excitations  and  by  fault/ tolerance  compensators.  Therefore,  our  prob¬ 
lem  is  to  locate  the  fault  compensators. 

Consider  the  circuit  in  Fig.  2a,  with  (m  +  1)  accessible  nodes  and  b 
branches.  Its  Fault/Tolerance  Compensation  Model  is  given  in  Fig.  2b,  where 
Jb  is  the  F/T  compensator  vector.  When  these  two  circuits  are  excited  by  the 
same  and  the  responses  Vm  and  (Vm  +  AVm)  are  measured  respectively,  it  is 
easy  to  show  that 

^  Jb  -  -AVm  (7) 

where  Z,^  is  the  branch-to-accessible  node  transfer  function  matrix,  depending 
only  on  the  nominal  circuit. 

Equation  (7)  was  first  derived  by  Biernacki  and  Bandler  [37],  based  on 
linear  perturbations.  By  use  of  the  Fault/Tolerance  Compensation  model,  we 
have  shown  that  Eq.  (7)  is  also  valid  for  linear  nominal  circuit  with 
nonlinear  perturbations.  In  fact,  the  perturbations  can  be  from  R  to  parallel 
RC,  or  from  an  operational  amplifier  to  a  complex  circuit  of  (nonlinear) 
transistors  and  capacitors.  Therefore,  the  problem  of  modeling  faulty  devices 
can  be  avoided  as  long  as  its  nominal  model  is  linear. 

Finally,  the  compensator  Jb  has  two  components: 

Jb  "  JbF  +  JbT  (8) 

where  Jbp  is  the  fault  compensator  and  Jbx  is  the  tolerance  compensator.  Jbp 
is  sparse  but  non-zero  entries  are  large  while  JbT  is  small  but  unknown  to  us. 
The  problem  becomes  the  determination  of  the  non-zero  entries  of  Jbp  from  (7) 
and  (8).  This  will  be  discussed  in  the  next  two  sections. 

VI.  k-FAPLT  DIAGNOSIS: -  The  Ideal  Case 

In  this  section,  we  assume  that  the  number  of  accessible  nodes  m  is  much 
smaller  than  the  total  number  of  nodes  n,  i.e.,  m  <<  n.  We  further  assume 
that  the  number  of  faulty  branches  is  much  smaller  than  the  total  number  of 


branches,  i.e.,  k  <<  b.  The  nominal  circuit  is  linear  and  it  can  be  repre¬ 
sented  by  Eqs.  (7)  and  (8).  Finally,  we  assume  that  J^p  *  0.  In  other  words, 
we  want  to  locate  the  non-zero  entries  of  from  (7),  where  Jfc  is  sparse. 

A  solution  to  this  problem  can  be  found  in  [49],  which  will  be  presented  as 
follows . 

Consider  the  following  equation: 

B  x  «  p  (9) 

where  xe  Cn,  pe  (7°,  Be  CP1*11  and  n  >  m.  Let 

Xfc  *  {x  e  Cn  |  number  of  non-zero  components 
of  x  not  exceeding  k} . 

We  want  to  investigate  the  uniqueness  of  the  solution  x  e  . 

Definition .  The  global  column-rank  of  B  is  said  to  be  r  if  every  combi¬ 
nation  of  r  columns  of  B  is  linearly  independent,  and  some  combination  of 
(r+1)  columns  of  B  is  linearly  dependent. 

Let  SI  be  the  range  of  B.  Then  the  following  theorem  is  given  in  [49]. 
Theorem' 1 .  Let  p  eft.  Then  Eq.  (9)  has  a  unique  solution  x  e  X^  for 
almost  all  p  e  SI  if  and  only  if  the  global  column-rank  of  B  is  at  lea^t  k+1. 

Note  that  for  fault  diagnosis  problem,  p  e  SI  is  guaranteed.  When  Theorem 
1  is  satisfied,  the  solution  of  (9)  can  be  obtained  in  the  following  way.  Let 
the  column  vectors  of  B  be  denoted  by 

B  -  (b1,b2 . bn] 

Let  the  test  matrices  T^  be  constructed  in  the  following  way: 

Ti  m  [ Bj, j » b ^2 » •  •  •  ,p ] ,  i  *  1 ,2, . . .  ,s  ;s  a  C(n,k) 

for  every  combination  of  k  columns  of  B.  Then, 

(A)  If 

Rank  Tj  ■  k  +  1,  i  ■  1,2,. ..,s  (10) 
then  there  is  no  solution  x  e  X^. 
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(B)  Otherwise,  there  is  a  unique  j  such  that 

Rank  Tj  -  k  (11) 

and 

Rank  T£  ■  k  +  1  ,  i  *  j.  (12) 

Furthermore,  the  unique  solution  x  e  is  given  by 

x  -  (BjTBj)_1  BjP  (13) 
where  Bj  *  [bjx »bj2, • . . ,bjkl . 

As  a  consequence,  the  k-fault  diagnosability  depends  on  the  global 
column-rank  of  ^5.  Let  us  now  examine  the  issue  of  (1)  post-fault  computa¬ 
tion  and  (2)  the  number  of  required  test  points. 

1.  Post-Faolt ~ Computat ion .  The  post-fault  computation  includes  only 

Eqs.  (10-13)  It  involves  the  test  of  the  rank  of  mx(k+l)  matrices  Tj,  and  an 

inversion  of  a  kxk  matrix  (Bj^Bi).  This  is  a  very  good  feature  since  both  m 

and  k  are  small.  More  importantly,  the  size  of  these  matrices  does  not  depend 
on  the  complexity  of  the  circnit  b.  The  total  post-fault  computation  can  be 
implemented  on  a  mini-computer  with  computation  time  ranging  from  a  fraction 
of  a  second  to  the  order  of  a  few  seconds. 

2.  Number  of- Reqoired' Test' Points .  For  k-fault  diagnosis,  it  is  neces¬ 
sary  for  the  number  of  required  test  points  ra  >  k  +  1.  It  is  important  to 

note  that  ^t_  again  depends  mainly  on  k  not  on  b~,  the  complexity  of  the 

circuit . 

As  an  example,  assuming  the  optimal  condition  m  ■  k  +  1,  for  a  single¬ 
fault  diagnosis,  we  need  only  two  test  points,  testing  the  rank  of  2x2 
matrices  and  an  inverting  an  lxl  matrix,  regardless  of  how  large  b  is. 

Next,  we  will  investigate  how  the  optimal  condition  m  *  k  +  1  can  be 
achieved.  This  requires 

global  column-rank  Zg^  *  o  *  k  +  1  (14) 
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This  depends  on  how  the  circuit  is  designed  and  more  importantly  how  the  test 
points  are  located.  It  turns  out  surprisingly  that  the  global  column-rank  of 
Z^  depends  mainly  on  the  topology  of  the  circuit,  not  its  element  values. 

3.  Topological ' Conditions'  for' k-Faolt  Piagnosibility.  First,  it  can  be 
shown  that  if  there  is  any  internal  loop  (loops  not  incident  with  test  nodes) 
consisting  of  r  branches  than  the  global  column-rank  of  Z^  cannot  be  greater 
than  (r-1).  Therefore,  we  can  diagnose  at  most  ( r-2)-faults .  Since  most 
circuits  have  loops  consisting  of  three  branches,  we  can  diagnose  up  to  only 
one-fault.  This  is  a  serious  limitation.  Alternatively,  we  can  work  on  the 
node-fault  diagnosis  equation 

Zmn  Jn  »  (15) 

where  Jn  =  A  Jj,  is  the  node  compensator  vector,  A  and  A  the  incident 

matrix.  Theorem  1  can  now  be  applied  to  (15).  In  this  case,  a  topological 
condition  has  been  derived  independently  by  Huang,  Lin  and  Liu  [49]  and  Togawa 
and  Matsumoto  [51].  First,  construct  a  testing  graph  Gt  from  the  given  graph 
G  by  (1)  deleting  all  branches  which  are  incident  between  two  accessible  nodes 
and  (2)  connecting  all  accessible  nodes,  except  the  reference  node,  to  a  new 
node  t. 

Theorem* 2 .  Let  the  network  be  passive  and  G  be  connected.  The  following 
three  statements  are  equivalent: 

(A)  The  global  column-rank  of  Z^  is  k  for  almost  all  branch 
admittances . 

(B)  The  local  connectivity*  between  the  node  t  and  any  inaccessible  node 
in  Gt  is  k. 

(C)  There  are  at  least  k  independent  paths*  in  Gt  from  any  inaccessible 


node  to  the  node  t . 
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The  topological  conditions  for  the  global  column-rank  of  equal  to  2 
is  given  in  [50].  A  more  general  topological  condition  is  obtained  in  [52]. 
These  topological  conditions  are  very  useful.  (1)  They  provide  a  very  easy 
evaluation  of  the  global  column-rank  of  a  transfer  function  matrix.  (2)  They 
provide  a  foundation  for  the  design  of  diagnosable  circuits  and  (3)  They 
provide  the  foundation  for  the  design  of  locations  of  test  points. 

VII.  k- FAULT  DIAGNOSIS:  The  Tolerance  Case 
The  fault  diagnosis  equation  for  the  tolerance  case  is  given  by  (7)  and 

(8) 

Zmb^bF  +  JbT>  “  “AVm  (16) 

which  can  be  shown  to  have  the  form 

<Zmb  +  *Zmb>JbT  «  -&Vm  (17) 

where  AZ^  is  caused  by  the  tolerance.  This  can  be  viewed  as  the  same  problem 
as  the  ideal  case  except  that  the  matrix  Zjq],  is  polluted  by  the  noise  AZ^. 
Therefore,  we  need  robust  computational  methods  for  the  testing  procedures 
(10-13).  A  well-known  method  for  the  test  of  the  rank  of  a  noisy  matrix  is 
the  singular  value  decomposition  method  [53].  Unfortunately  this  method  is 
unreliable  for  the  test  of  the  global  column-rank  of  a  noisy  matrix.  There 
exists  examples  where  the  singular  value  decomposition  method  fails  [55].  In 
order  to  circumvent  this  problem,  Suen  and  Liu  have  developed  a  method  using 
residual  numbers  for  the  testing  of  global  column-ranks  [55].  The  same  method 
can  also  be  used  for  the  tests  (10-12)  [54,  55].  For  all  the  cases  we  have 
tried,  this  method  has  shown  to  be  very  reliable  when  the  ratio  fault/toler¬ 
ance  is  large. 


VIII.  CONCLUSION 


Two  main  approaches  to  analog  fault  diagnosis  has  been  presented:  The 
element-value  solvability  problem  and  the  fault/tolerance  compensation 
approach . 

The  element-value  solvability  problem  is  the  same  as  the  n-port  synthesis 
problem  except  that  the  graph  and  the  element-type  have  been  prescribed. 
Therefore,  it  is  very  much  in  the  interest  of  circuit  theorists.  The  major 
advantage  of  this  approach  is  that  the  tough  problem  of  tolerance  is  avoided. 
The  difficulty  of  this  approach  is  that  its  performance  is  limited  by  a  trade¬ 
off  between  the  large  number  of  test  points  required  and  the  complexity  of 
post-fault  computation. 

The  fault/tolerance  compensation  approach  requires  a  very  few  number  of 
test  points  and  very  simple  post-fault  computation.  However,  the  number  of 
faulty  devices  is  limited  to  be  small.  More  importantly,  we  have  to  face  up 
to  the  tolerance  problem,  although  some  limited  success  in  this  direction  has 
been  accomplished. 

In  summary,  in  this  uncharted  area  of  research  we  have  been  struggling  in 
the  past  to  find  out  what  the  real  problems  are,  and  we  begin  to  see  some 
daylight.  It  still  has  a  long  way  to  go. 

For  one  thing,  the  result  presented  is  for  single-signal  and  single¬ 
frequency.  When  multiple-signal  and/or  multiple-frequency  are  used,  the  per¬ 
formance  should  be  improved  since  more  information  is  gathered  for  analysis. 
Some  initial  studies  have  been  made  [14,21,33,36,44,  48].  However,  how  to 
choose  these  signals  and  frequencies  so  that  the  extra  information  can  be  used 
more  efficiently  and  effectively  and  what  the  limitations  are,  are  still  un¬ 


known  to  us . 
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The  diagnosis  of  nonlinear  circuits  are  barely  touched  [20,31,39,40,42], 
This  important  problem  is  still  at  large. 

The  problem  of  diagnosis  of  analog/digital  hybrid  circuit  is  essentially 
untouched  [32], 

The  fault  diagnosis  techniques  have  other  applications.  Since  it  has  the 
capability  of  locating  the  faulty  devices  from  available  test  points,  it  has 
the  potential  to  be  integrated  with  manufacture  process  to  do  CAM  for  IC 
chips.  For  another  application,  the  single-frequency  single-signal  method  can 
do  on  1 ine  fault  diagnosis.  Therefore,  it  provides  an  important  link  for  the 
design  of  self-repairing  systems. 
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Abstract— A  concept  of  k -node-fault  testability  is  introduced.  A  suffi¬ 
cient  and  almost  necessary  condition  for  testability  as  well  as  the  test 
procedure  is  presented.  This  condition  is  further  evolved  to  a  necessary  and 
almost  sufficient  topological  condition  for  testability.  A  unique  feature  of 
this  condition  is  that  it  depends  only  on  the  graph  of  the  circuit,  not  on  the 
element  values.  Based  on  this  condition,  a  design  of  testability  can  be 
established. 

I.  Introduction 

FAULT  DIAGNOSIS  of  analog  networks  has  become 
an  increasing  active  research  area  recently  [1]— [1 1],  A 
good  survey  on  this  subject  can  be  found  in  [1],  [2], 
Roughly  speaking,  analog  fault  diagnosis  is  the  problem  of 
detecting  the  faulty  analog  networks  and  locating  the 
faulty  elements  or  parameters  in  those  faulty  networks.  In 
this  paper,  we  are  concerned  with  only  the  second  part  of 
the  problem,  namely,  the  fault  location  problem. 

Recently,  two  similar  methods  on  fault  location  of  linear 
networks  which  involve  only  linear  equations  have  been 
proposed  by  Biemacki  and  Bandler  [9],  [10]  and  Skala  et  al. 
[11].  These  methods  are  based  on  checking  consistency  or 
inconsistency  of  certain  linear  equations  which  are  in¬ 
variant  on  faulty  elements.  And  they  have  been  shown  to 
be  equivalent  in  [17],  Since  only  linear  equations  are  in¬ 
volved,  they  are  computationally  appealing.  However,  as 
far  as  testability  condition  is  concerned,  none  is  given 
explicitly  in  [11],  while  only  a  necessary  condition  is  pro¬ 
vided  in  [10].  This  necessary  condition  is  given  in  terms  of 
network  functions  (impedance  function).  Hence  it  is  not 
easy  to  apply,  especially  in  the  environment  of  testability 
design. 

In  Section  II  of  this  paper,  instead  of  port-voltages  in  [9], 
[10]  or  branch- voltages  in  [11],  node- voltages  are  used  as 
the  basis  for  the  derivation  of  the  node-diagnosis  equation. 
In  Section  III,  following  the  procedure  given  in  [11],  a 
necessary  condition  for  testability  is  hence  derived.  The 
major  effort  is  to  show  that  this  necessary  condition  is  also 
almost  sufficient. 

The  main  contribution  of  this  paper  is  given  in  Section 
IV.  It  is  shown  that  the  testability  condition  which  is  based 
on  network  functions,  can  be  reduced  to  a  condition  which 
depends  only  on  the  graph  of  the  circuit  not  the  element 
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values.  The  simplicity  of  the  topological  condition  for 
testability  allows  us  to  study  ( 1 )  the  design  of  the  location 
and  the  number  of  test  points  for  a  given  circuit,  and  (2) 
the  design  of  alternative  circuits  which  are  testable. 

For  the  purpose  of  the  application  to  branch  diagnosis. 
Theorem  5  of  Section  V  provides  a  method  to  determine 
the  faulty  branches  as  well  as  their  deviated  admittance 
values  if  the  node  fault  testability  condition  is  satisfied. 
Examples  are  given  in  Section  VI. 

II.  Node-Diagnosis  Equations 

Let  N  be  a  6-branch,  ( n  +  l)-node.  ( m  +  1  j-terminal. 
linear,  time-invariant,  lumped  nonreciprocal  network  (Fig. 
1 ).  Out  of  the  ( n  +  1 )  nodes.  ( m  +  1 )  are  accessible  termi¬ 
nals  for  excitation  and  measurement.  Label  the  nodes  in 
the  following  way.  Let  one  of  the  accessible  nodes  be  the 
reference  node.  Then  label  the  rest  of  the  accessible  nodes 
(i  =  1,-  •  -  ,m)  ahead  of  inaccessible  nodes  (i  =  m  +  1,-  •  -,n). 
The  following  notations  are  used  for  V: 
vm  terminal  (accessible  node)  voltage  vector. 
im  terminal  current  vector, 
vb  branch  voltage  vector, 
ib  branch  current  vector. 

vn  node  (including  accessible  node)  voltage  vector, 

Yb  branch-admittance  matrix, 

Y„  node-admittance  matrix. 

Note  that  Yb,  Y„  need  not  be  symmetrical  and 

-  Ybvb  ( 1 ) 

Yn-AYbAT  (2) 

where  A  is  the  incidence  matrix  of  the  graph  and  Y„  can  be 
obtained  from  the  network  directly. 

Now  consider  that  the  network  N  is  perturbed  to  (.V  + 
AV)  in  the  way  that  Yb  is  perturbed  to  Yb  +  AT*,  and  the 
graph  remains  the  same.  In  the  perturbed  network,  we 
denote  the  corresponding  voltages,  currents,  and  admit¬ 
tance  by  vm  +  Aom,  im  +  Aim,  vb  +  Ac*.  ih  +  A<„.  v„  +  Ac,, 
and  Y„  +  AT,,  respectively. 

Given  Yb  and  the  graph,  the  purpose  of  the  fault  diagno¬ 
sis  is  to  estimate  AY*  from  the  information  i„,  lvm,  and 
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Fig.  2. 


A/m.  Note  that  vm  can  be  calculated  from  im  and  N. 
Without  loss  generality,  we  may  assume  that 

Aim  =  0 


i.e.,  apply  the  same  im  to  the  original  network  N  and  the 
perturbed  network  N  +  AN. 

Denote  Z„  and  partition  it  in  the  following  way: 


=  v-i 


z_  =  y 


(3) 


where  Zmn  e  Cm*\  Z,n  €  Clxn,  and  /  =  n  -  m. 

Under  these  conditions,  the  node- voltage  equations  for 
N  and  N  +  AN  are  given  by 


Y.v. 


(y.  +  AnMo.  +  Ao.)- 


'« 

L0 

‘m 

0 


(4) 

(5) 


A  subtraction  of  (5)  by  (4)  yields 

y(IAqI--Ay>(1  +  A«>,1)±y,,  (6) 

where  the  RHS  is  denoted  by  jn.  Using  (3),  (6)  becomes 


It  is  easy  to  see  from  (6)  that  if  node  /  is  fault-free,  then 
the  /th  component  of  jn  is  zero.  However,  the  converse  may 
not  be  true.  It  is  possible  tha.  node  /  is  at  fault  but  the 
effect  of  faulty  branches  associated  with  node  i  may  cancel 
each  other  to  give  zero  to  the  /th  component  of  j„.  Thus 
possibility  is  very  very  small.  Henceforth,  we  will  make  the 
following  assumption. 

Assumption  1:  If  node  /  is  faulty  then  the  /  th  component 
of  j„  is  nonzero.  Under  this  assumption,  node  /  is  faulty  if 
and  only  if  the  /  th  component  of  j„  is  nonzero. 

Next,  we  want  to  determine  those  nonzero  components 
of  /„  from  the  node-diagnosis  equation  (8). 

Ill.  /c-Node-Fault  Testability 

We  start  with  a  definition  of  testability. 

Definition  2:  A  network  N  is  said  to  be  k-node  fault 
testable  if  when  N  is  perturbed  to  (jV  +  AN),  one  will  be 
able  to  determine,  by  choosing  appropriate  testing  signals 
im,  from  the  measurements  on  accessible  nodes: 

(a)  whether  or  not  N  has  no  more  than  k  node  faults, 

(b)  if  affirmative,  the  faulty  nodes  can  be  uniquely 
located. 

Recall  that 

ZmnJn  =  *Vm  (8) 

and  (with  Assumption  1)  that  node  /  is  faulty  if  and  only  if 
the  i  th  component  of  y„  is  nonzero.  Let 

Xk  =  {x  e  C" |  number  of  nonzero  components 

of  x  not  more  than  k) 


Znjn  - 

(7) 

^ mnJn  *  * 

(8) 

The  equations  (6)  and  (8)  are  called  the  node-diagnosis 
equations. 

Since  Zmn  and  Avm  are  known,  we  can  first  solve  for  jn 
from  (8)  and  then  estimate  AT„  from  (6).  However,  for 
practical  diagnosis  problems,  n  is  much  larger  than  m,  and, 
therefore,  j„  is  not  unique.  Furthermore,  At>„  is  unknown, 
which  creates  difficulties  for  solving  (6).  However,  if  we 
take  advantage  of  a  structural  property  of  the  network, 
which  will  be  discussed  later,  these  difficulties  can  be 
circumvented.  This  will  be  discussed  next. 

A  crucial  property  of  A  Y„  is  that  it  has  the  same  struc¬ 
ture  as  y„,  i.e.,  if  (/,  f)th  element  of  Y„  is  zero,  then  the 
(/,  y  )th  element  of  A  Y„  is  also  zero.  The  problem  of  locat¬ 
ing  the  faulty  branches  becomes  the  problem  of  locating 
the  positions  of  nonzero  entries  of  AYn. 

Definition  l:  Branch  k,  k  - 1,2,-  •  -,6,  is  said  to  be 
fault-free  if  the  k  th  row  of  A  Yb  is  zero.  Node  /,  i  *  1 . 2,  •  •  • ,  n, 
is  said  to  be  fault-free  if  all  branches  incident  with  node  i 
are  fault-free.  Otherwise,  they  are  faulty. 

Nott  that  the  number  of  node  faults  may  be  smaller  than 
the  number  of  branch  faults.  This  is  illustrated  in  Fig.  2. 
There  are  three  branch  faults  (a,b,c)  but  only  two  node 
faults  (1,2). 


then  the  perturbed  network  N  +  AN  has  the  number  of 
multiple  node-faults  less  than  or  equal  to  k  if  and  only  if 
j„  €  Xk.  Let  zt,  i  =  1,2.-  •  -,n  be  the  columns  of  Zmn.  We 
call  the  condition 

r*nk[vV,vJ"*  +  l, 

VI  <  <|  <  /2  <  •  •  •  <  i**i  ^  n  (9) 

the  k-node- fault  testability  condition. 

Let  the  test  matrices  be  the  combinations  of  k  columns  of 
Zmn  and  Acm,  i.e., 

r<m  0°) 

i  =  1,2, -  -  -,s,  where  s  ”  C(n,  k).  Then  we  have  the  follow¬ 
ing  two  theorems. 

Theorem  l:  Let  the  k-node-fault  testability  condition  (9) 
be  satisfied. 

(a)  If 

rank7;-A;  +  l,  i  —  1 , 2. -  -,s  (11) 

then  N  has  more  than  A:-node  faults.  Conversely,  if  N  has 
more  than  /c-node  faults,  then  (1 1)  is  satisfied  for  almost  all 

(b)  If  (1 1)  is  not  satisfied,  i.e., 

rankT,  <k  +  \  (12) 

for  some  then  N  has  no  more  than  A:- node  faults  for 
almost  all  Aom.  Furthermore,  all  faulty  nodes  can  be 
uniquely  isolated  by  (13). 
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Proof:  (a)  Conditions  (9)  and  (11)  imply  that  Aum 
cannot  be  represented  by  any  k  columns  of  Zm„,  i.e.,  (8) 
has  no  solution  in  Xk.  Therefore,  N  has  more  than  fc-node 
faults. 

Conversely,  if  Af  has  more  than  fc-node  faults,  then 
jn  €  Xk,  i.e.,  for  almost  all  Aum,  it  cannot  be  spanned  by  k 
columns  of  ZmH.  Therefore,  Condition  (11)  is  satisfied  for 
almost  all  Aom. 

(b)  Let  the  test  in  (a)  be  failed.  Therefore,  there  are  at 
most  k- node  faults,  and  (12)  is  satisfied.  It  follows  from  (9) 
that 

V ' -  >z«  J  *  rank  [r.v  V ' -  "  k 

for  some  (z,t,  z,v •  •  -,zik).  Consequently,  (8)  has  a  solution 
in  Xk.  Furthermore,  in  view  of  Theorem  A1  in  the  Appen¬ 
dix,  the  solution  is  unique  for  almost  all  Aum. 

Finally,  the  faulty  nodes  can  be  located  as  follows.  From 

(8) ,  delete  the  columns  of  Zmn  and  components  of  jn  which 
are  not  associated  with  (rj|(  zh,- ■  As  a  result,  we 
have 

Zmkjk  -  Aom 

where  Z„*-[ andy* -col(y(|, •• -,yjt).  Then 
jk-{ZTmkZmkY'zTmk  Auw.  (13) 

The  nonzero  components  of  jk  indicate  the  faulty  nodes. 

Q.E.D. 

Theorem  2:  N  is  k-nodc-fault  testable  if  the  A>node-fault 
testability  condition  (9)  is  satisfied.  Conversely,  if  a  single 
test-vector  is  used  then  condition  (9)  is  also  necessary. 

Proof:  Let  (9)  be  satisfied.  Theorem  1  implies  that  N 
is  fc-node-fault  testable  for  almost  all  Avm.  Consequently, 
there  exists  an  im  so  that  N  is  fc-node-fault  testable.  Con¬ 
versely,  if  a  single  test-vector  is  used,  then  applying  Theo¬ 
rem  A1  to  (8)  one  can  conclude  that  condition  (9)  is  also 
necessary  for  k-node-fault  testability.  Q.E.D. 

Remark  3.1:  The  idea  of  setting  the  fault-diagnosis 
equation  by  an  algebraic  linear  equation,  and  locating  the 
fault  elements  by  a  matrix  rank  test  were  presented  by 
Biernacki  and  Bandler  (9]  and  Sakla,  El-Masry  and  Trick 
(11).  Some  conditions  on  the  matrix  rank  test  for  multiple- 
fault  isolation  was  First  presented  by  Biernacki  and  Bandler 

[9] ,  [10];  and  also  by  Trick  (24)  and  Trick  and  Li  [25].  The 
necessary  and  almost  sufficient  condition  (Theorem  1)  is 
first  given  by  Huang,  Lin,  and  Liu  [18],  and  also  appeared 
in  Trick  and  Li  [25]. 

Remark  3.2:  Biernacki  and  Bandler  applied  the  matrix 
rank  test  method  to  the  branch-diagnosis  equation 

2mhjb  -  Aom 


and  Trick  to  a  sensitivity  equation,  which  can  be  shown  to 
be  equivalent  to  the  branch-diagnosis  equation  [26].  The 
difficulty  of  using  the  branch-diagnosis  equation  is,  as 
pointed  out  by  Trick  [25],  that  multiple  branch  faults 
usually  cannot  be  uniquely  located.  On  the  other  hand, 
multiple  node  fault  can  be  uniquely  located.  Therefore,  we 
used  the  node-diagnosis  equation  instead  of  the  branch- 
diagnosis  equation. 


Remark  3.3:  The  number  k  of  multiple  faults  can  be 
tested  by  (1 1),  not  pre-assumed  as  the  case  in  [9|— [  11],  [24]. 
Since  k  is  usually  not  known,  this  feature  has  practical 
importance. 


IV.  Testability  Design 

Partition  the  matrices  Yn  and  Z„  in  the  following  way: 


I  =  YnZn  =  [YamYj 


■*lr% 


where  m  +  l  —  n.  Let  the  row-vectors  of  Yn,  be  denoted  by 
i«  1,2, •••,#!,  i.e., 

>i 


‘ nl  ' 


h 


LY*  J 

An  equivalent  testability  condition  can  be  stated  in  terms 
of  row-vectors  of  Yn,. 

Theorem  3:  The  k-node-fault  testability  condition  (9)  is 
satisfied  if  and  only  if 


rank 


Yj, 

Yh 


=  /,  VI  ^j.<j2---<j,<n  (14) 


where  t  *  n  -  k  -  1. 

The  proof  of  Theorem  3  is  given  in  the  Appendix  B. 

Since  usually  n  -  k  —  1  »  k  +  1  and  m.  condition  (9) 
is  easier  to  be  tested  than  (14).  However,  condition  (14)  is 
useful  for  the  design  of  testability  if  condition  (9)  fails.  This 
is  because  each  element  of  Yn,  has  a  direct  interpretation  in 
the  circuit.  For  example,  since  Yn,  concerns  only  those 
branches  connected  with  inaccessible  nodes,  the  branches 
among  accessible  nodes  have  no  effect  on  the  testability.  We 
will  now  present  a  criterion  which  depends  only  on  the 
graph  of  the  circuit,  and  thereby  provides  a  means  for  the 
design  for  testability. 

We  will  first  consider  the  network  N  with  no  dependent 
sources.  From  the  graph  G  of  the  network  N.  we  construct 
a  testing  graph  G,.  First,  delete  all  branches  which  are 
incident  between  two  accessible  nodes.  Then,  connect  all 
accessible  nodes,  except  the  reference  node,  to  a  new  node 
t. 

Theorem  4:  Let  <7  be  connected  and  k  <  m.  The  follow¬ 
ing  three  statements  are  equivalent. 

(a)  The  k-node-fault  testability  condition  ( 14)  is  satisfied 
for  almost  all  values  of  branch  admittances. 

(b)  Any  inaccessible  node  and  the  node  t  have  at  least 
(k  +  1)  local  connectivity1  in  G,. 

(c)  There  are  at  least  (k  + 1)  independent  paths'  from 
any  inaccessible  node  to  the  node  t  in  Gr 


'The  local  connectivity  of  two  nonadjacent  nodes  is  the  minimum 
number  of  nodes  separating  them,  and  two  paths  are  independent  if  they 
do  not  have  any  common  node,  except  at  the  terminal  nodes  [16]. 
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Before  the  theorem  is  shown,  let  us  prove  two  lemmas. 
First,  consider  any  (n  -  k  -  l)x  /  submatrix  Y  of  Ynl-  Let 
these  (rt-  k  -  l)  rows  correspond  to  the  set  M2  of  m2 
accessible  nodes  and  the  set  L,  of  /,  inaccessible  nodes 
where  m2  +  /,  »  n  -  k  - 1.  Then  the  node  equation  V„u„  =  in 
can  be  rearranged  to  the  form 


Ym  ,  Ym  . 

”2*1  "•5*2  ,  , 

r-  Y  y  •  o«) 

Hence  the  set  M,  of  m,  accessible  nodes  and  the  set  L2  of 
l2  accessible  nodes  are  those  not  associated  with  the  rows 
of  Y. 

Note  that 

/c  +  l-m,  -/2  (17) 

and  if  m,  +  m2  -  m  >  k,  then 

m2  ^  /2.  ( 18) 

The  purpose  of  the  following  two  lemmas  is  the  con¬ 
struction  of  a  subgraph  G0  of  G,,  which  will  be  useful  in 
the  proof  of  Theorem  4. 

Lemma  l:  If  (b)  of  Theorem  4  is  satisfied,  then  there 
are  /2  disjoint  paths  connecting  the  set  L2  and  M2  in  G,  and 
no  two  such  paths  have  any  common  node. 

Proof:  From  hypothesis  (b),  the  local  connectivity  in 
G,  between  the  node  t  and  any  inaccessible  node  is  at  least 
k  + 1.  Now,  remove  all  branches  from  Af,  to  the  node  t. 
Then  the  local  connectivity  between  any  inaccessible  node 
in  L2  to  the  node  t  is  at  least  /c  +  l-m,  which  is  l2  by  (17). 
Finally,  connect  all  the  nodes  in  L2  to  a  new  common  node 
q.  The  local  connectivity  between  q  and  t  is  exactly  /2. 
Therefore,  by  Menger’s  Theorem,  there  exist  l2  indepen¬ 
dent  paths  connecting  q  and  t,  as  shown  in  Fig.  3.  Conse¬ 
quently  the  lemma  is  proved.  Q.E.D. 

Lemma  2:  Let  (b)  be  satisfied.  In  <?,,  there  exists  a 
loopless  connected  subgraph  G0  which  contains  the  node  r, 
all  inaccessible  node  and  all  the  disjoint  paths  in  Lemma  1. 

Proof:  First,  take  all  the  disjoint  paths  given  in  Lemma 
1  and  extend  the  paths  to  the  node  t.  This  subgraph 
contains  all  the  nodes  in  L2  and  the  node  t.  For  every  node 
in  L,,  by  (b),  there  exists  a  path  from  it  to  the  node  t.  Now, 
G0  can  be  constructed  by  repeating  the  following  process. 


Pick  a  node  in  £,,  follow  a  path  to  node  t  and  stop  when  it 
first  reaches  a  node  of  existing  subgraph.  Then  add  all  the 
new  branches  to  the  existing  graph  and  repeat  the  process 
for  the  next  node  in  L,.  A  typical  G0  is  shown  in  Fig.  4. 

Q.E.D. 

Proof  of  Theorem  4: 

(b)  =■»  (a)  Consider  any  (n  -  k  -  l)x  /  submatrix  Y  of 
n,-  We  want  to  show  that  Y  has  full  column-rank  for 
almost  all  branch  admittances.  In  the  first  step,  we  want  to 
show  that  it  is  true  for  a  particular  set  of  branch  admit¬ 
tances.  We  choose  a  specific  set  of  branch  admittances  in 
the  following  way:  first,  set  all  branch  admittances  to  zero 
for  those  branches  not  in  G0;  and  then,  set  other  branch 
admittances  to  positive  values.  Therefore,  this  network  is 
connected,  passive,  and  resistive.  Let  it  be  denoted  by  S0. 
We  want  to  show  that,  for  the  network  N0,  Y  has  full 
column-rank. 

With  S0,  we  ground  all  accessible  node  and  excite  all 
nodes  in  L,  and  L2  by  voltage  sources  v,t  and  r,,,  as  shown 
in  Fig.  5.  From  (15),  we  have 


If  Y  is  not  full  column-rank,  then  there  exists  nonzero 
(o/(,  vtj)  such  that  (imj,  i,t )  -  0.  We  want  to  show  that  this 
is  not  possible  for  N0.  Since  jV0  is  a  connected  passive 
network,  and  (v^v^)  are  the  only  sources,  it  concludes 
that  (i,)t  i/f)  •  0  if  (v,r  v,2)  +  0.  If  i/(  •  0,  we  already  have  a 
contradiction.  It  remains  to  consider  the  case  when  -  0 
and  i,2  •  0.  We  want  to  show  that  in  this  case  im  *  0.  Since 
I,,  -  0  and  jV0  is  passive  resistive  circuit,  we  may  delete  all 
branches  except  the  paths  given  in  Lemma  1,  as  far  as 
KCL  equation  is  concerned.  Since  all  these  paths  are 
disjoint,  »  0  implies  that  imj  *  0,  which  again  is  a  con¬ 
tradiction.  Therefore,  ( »,,,  v,2)  -  0  if  and  only  if  (imj,  i,t )  - 
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0.  It  concludes  from  (19)  that  Y  has  full  column-rank  for 
*o- 

Now  in  the  second  and  final  step,  we  want  to  show  that 
Y  has  full  column-rank  for  almost  all  branch  admittances. 
Note  that  Y  has  full  column-rank  if  and  only  if  det(yry) 
»  0.  A  key  fact  is  that  det(K7Y)  is  a  polynomial  of  the 
branch  admittances,  and  it  is  not  zero  for  a  particular  set 
of  admittances,  namely  N0.  By  a  well-known  theorem  [20. 
21],  it  can  be  concluded  that  det(yry)  *  0  for  almost  all 
branch  admittances.  This  proves  (b)  (a). 

(b)  «•  (c):  Directly  from  the  Menger’s  Theorem  [16]. 

(a)  **  (b):  We  will  prove  it  by  contradiction.  Let  (b)  be 
false.  Then  there  are  k  nodes  which  separate  G,  into  two 
parts:  Part  A  contains  node  t  and  Part  B  contains  at  least 
one  inaccessible  node.  Since  all  accessible  nodes  are  adjac¬ 
ent  to  r,  they  cannot  belong  to  Part  B.  Let  Part  B  contain 
r(r  >  1)  inaccessible  nodes.  Note  that  any  element  of  Yn 
is  zero  if  and  only  if  nodes  i  and  j  are  nonadjacent.  Since 
Part  A  and  Part  B  are  separable,  the  element  of  Yn  which 
associate  with  any  nodes  of  Part  A  (except  t )  and  any  node 
of  Part  B  is  zero.  Since  there  are  r  inaccessible  nodes  in 
Part  B,  there  are  a  zero-submatrix  Y*  of  Ym  whose  di- 
mension  is  r  x(n  -  r  -  k).  Therefore,  from  [15],  any  sub¬ 
matrix  of  (n  -  k  -  1)X  /  of  Yn  which  contains  y*  cannot 
have  full  column-rank.  Henceforth,  condition  (14)  cannot 
be  satisfied  for  such  submatrix.  Q.E.D. 

Let  us  now  extend  Theorem  4  to  include  active  elements. 
First,  represent  all  active  elements  by  a  set  of  dependent 
current  sources  each  of  which  is  incident  to  the  reference 
node  on  one  side  and  to  any  other  node  on  the  other  side, 
and  controlled  by  node  voltages.  Note  that  a  dependent 
current  source  has  entry  in  the  submatrix  Yn/  if  and  only  if 
it  is  controlled  by  the  inaccessible  node  voltages.  In  view  of 
the  above,  we  will  construct  a  test  graph  G  from  the  graph 
G  by  the  following  steps.  (1)  Delete  all  dependent  sources 
which  are  controlled  by  accessible  nodes.  (2)  If  a  depen¬ 
dent  source  is  incident  with  an  accessible  node  /  but 
controlled  by  an  inaccessible  node  voltage  vJt  then  delete 
this  dependent  source  and  add  a  new  branch  incident  with 
nodes  i  and  j.  Note  that  G  and  G  have  the  same  nonzero 
entries  in  Y„t.  It  remains  to  consider  these  dependent 
sources  which  are  incident  with  inaccessible  nodes  and 
controlled  by  inaccessible  node  voltages.  A  complete  de¬ 
scription  of  this  would  be  too  involved  for  this  paper.  We 
will  give  only  necessary  and  sufficient  conditions.  Con¬ 
struct  Gs  from  G  by  deleting  all  dependent  sources  and  GN 
from  (/  by  the  following  step:  if  there  is  a  dependent  source 
incident  with  node  i  and  controlled  by  node  voltage  c;, 
then  delete  the  dependent  source  and  add  a  new  branch 
incident  with  nodes  i  and  j. 

Corollary  4.1:  If  GN  does  not  satisfy  (b)  or  (c)  of  Theo¬ 
rem  4,  then  N  is  not  fc-node-fault  testable.  If  Gs  satisfies  (b) 
or  (c),  then  N  is  k-node-fault  testable  for  almost  all  branch 
admittances. 

The  proof  is  straightforward  and  is  omitted. 

Remark  4.1:  The  necessary  topological  condition  of 
Theorem  4  is  first  given  by  [18],  Togawa  and  Matsumoto 
[28]  have  given  an  independent  proof  of  the  sufficient 


condition.  Starzyk  and  Bandler  have  studied  the  case  when 
the  topological  condition  is  partially  satisfied  [27], 

V.  Branch-Fault  Diagnosis 

In  this  section,  we  will  show  that  branch  faults  can  be 
isolated  if  multiple  test-signal  vectors  are  used. 

Theorem  5:  Let  N  and  its  adjoint  circuit  N  be  k-node- 
fault  testable.  If  N  +  A N  and  N  +  A N  have  no  more  th<ui  k 
faulty  nodes,  then  the  faulty  branches  can  be  located  and 
their  admittances  can  be  determined  by  a  set  of  m  indepen¬ 
dent  vectors  im. 

Proof:  Since  N  is  ^-node-fault  testable  and  N  +  AjV 
has  no  more  than  k  faulty  nodes,  by  Theorem  1,  jn  can  be 
uniquely  determined  for  almost  every  im.  From  (7)  and  (4), 


Ac*  =  ZJ„  (7) 


we  can  calculate  un  +  At>„  by 

d„  +  A ub  =  Zb(  +,„).  (20) 

Hence,  A  Yn  can  be  determined  by  (6) 

-yn  =  AT„(un  +  AuJ  (6) 

using  multiple  independent  test-signal-vectors  im  in  the 

following  way.  Take  independent  im’s  so  that 

*m2>‘  ‘  ’  (21) 

is  nonsingular.  Correspondingly,  we  can  measure 

A^  =  [Aum|,ADm;.---,AumJ  (22) 

Then  it  can  be  shown  that  when  im  is  applied  to  the 

perturbed  adjoint  circuit 

(t  +  AfJ-(i;  +  Arn)r  (23) 

the  perturbed  voltage  can  be  calculated  by 

At3„  =  (A  VJZ'fL-  (24) 

With  this  Ai3m  and  im,  we  can  apply  the  node-fault  diagno¬ 
sis  procedure  to  the  adjoint  circuit.  Since  by  hypothesis  the 
adjoint  circuit  is  k-node-fault  testable  and  has  no  more 
than  k  faulty  nodes,  we  can  identify  the  nonzero  rows  of 
A Y„;  i.e.,  k’  (/c'<  k)  nonzero  columns  A Yn  by  (23).  Once 
the  set  of  nonzero  columns  of  Ay„  is  identified,  we  go  back 
to  (6).  Let  5  be  a  column-selector  matrix  [22]  such  that 
[AJ^JS  is  an  nXk’  matrix  with  k'^k  and  that  [ATJ5 
retains  all  nonzero  columns  of  AT,.  Then  (6)  becomes 

-  h  -  [ArJk  +  AuJ  -  [A Y„]SST[v„  +  AuJ. 

Applying  the  m  independent  im’s  in  (21),  we  have  corre¬ 
spondingly  the  matrix  equation 

-/.-[AKjSSq^  +  AKj.  (25) 

It  remains  to  show  that  Sr[!^  +  AFJ  has  full  row-rank. 
From  (5)  we  have 

F,  +  AF,-(Z„  +  AZ,)  «(Z„m  +  AZ„J/m. 
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So, 

ST[Vn  +  WH)-ST(Znn  +  dZHm)Im. 

Since  the  adjoint  network  N  is  ^-node-fault  testable, 
ST(Znm  +  AZ„m)  has  full  row-rank  foT  almost  all  branch 
admittance.  Consequently,  ST[Vn  +  AFJ  has  full  row-rank 
for  almost  all  branch  admittance.  Therefore,  the  nonzero 
columns  of  Ay„  can  be  evaluated  by 

lAyJS--J,(K  +  AKjrS) 

•(S^  +  ArJlK  +  ArJ^)"'.  (26) 


test  points.  By  the  method  described  in  Corollary  4.1,  the 
test  graph  G,  and  G„  are  in  Fig.  8.  Note  from  Theorem  4 
that  the  ground  branches  have  no  effect  on  the  testability, 
and  hence  they  are  deleted.  For  this  circuit  (if  we  don’t 
consider  the  self-loops  in  node  3  and  7),  the  Ga  and  G„  are 
the  same.  We  can  easily  conclude  by  condition  (c)  of 
Theorem  4  that  the  circuit  is  one-node  fault  testable.  Now, 
suppose  that  we  want  it  to  be  2-node-fault  testable.  In  the 
Gs,  node  5  and  7  are  incident  with  only  two  branches 
respectively,  and  each  of  them  cannot  have  three  indepen¬ 
dent  paths  to  other  nodes.  Therefore,  nodes  5  and  7  have 
to  be  test  points.  Under  this  choice,  every  inaccessible  node 
has  three  independent  paths  to  the  accessible  nodes.  Hence 
by  Theorem  4  and  Corollary  4.1,  the  circuit  is  2-node  fault 
testable  for  almost  all  values  of  the  branch  admittances. 

In  the  previous  example  we  described  how  to  choose  the 
test  points  for  a  given  circuit. 

Example  2:  (Branch  Fault  Isolation ) 


From  this,  the  faulty  branches  can  be  located  and  their 
admittances  can  be  determined.  Hence,  Theorem  3  is 
proved.  Q.RD. 

Remark  5.1:  Note  that  N  and  A  N  may  not  have  the 
same  graph.  Therefore,  a  short  between  two  open  nodes 
can  be  detected,  although  in  this  case  an  extra  branch 
between  the  two  open  nodes  is  inserted  in  the  faulty 
network. 

Remark  5.2:  If  Y„  is  symmetrical,  then  the  conditions 
on  the  adjoint  circuit  are  automatically  satisfied,  and  only 
k  independent  test-signal-vectors  im  are  sufficient. 


Consider  the  network  N  as  shown  in  Fig.  9.  Nodes  1,2,3 
are  accessible  nodes.  Let  the  nominal  values  of  the  circuit 
of  Fig.  9  be  j^  —  1,  i  - 1,2,- • -,6,8  and  y7«  2.  We  can 
calculate  Z„„  -  [Z„  Z2,  Z3,  Z4J: 


0.593220  0.084746  0.067797  0.186441 
0.084746  0.440678  0.152542  0.169491 
0.067797  0.152542  0.322034  0.135593 


Now  assume  branches  3  and  8  are  perturbed  such  that 
y3  +  Ay3  -  2,  and  y%  +  Ay8  -  2  and  other  branches  remain 
the  same.  Choosing 


VI.  Examples 

Example  1:  (Testability  Design) 

For  a  given  circuit,  the  purpose  of  testability  design  is  to 
decide  the  number  and  the  location  of  test  points  such  that 
the  circuit  is  testable.  Let’s  consider  a  feedback  amplifier 
(Fig.  6).  First  we  get  .the  AC  small  signal  equivalent  circuit 
(Fig.  7).  It  is  reasonable  to  choose  node  1  and  8  to  be  the 


1  0  0 
0  1  0 
0  0  1 


we  can  measure  the  voltages  on  accessible  nodes: 


K.  +  AK. 


0.5625  0.0625  0.0625 
0.0625  0.4196  0.1339 
0.0625  0.1339  0.2767 


T 
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and  evaluate 


AT„ 


-0.030720  -  0.0222458  -  0.0052966 

-0.0222458  -  0.0210351  -0.0186138 

-0.0052966  -  0.0186138  -  0.0452482 


From  Theorem  4,  this  circuit  is  2-node-fault  testable.  By 
Theorem  1,  check  the  singularities  of  the  test  matrices 
[Z(,  Zj,  At>„]  for  all  1  <  /  <  j  <  4.  It  is  found  that  only 
det[Z3,  Z4,  Au„]  —  0.  Therefore,  we  can  conclude  that  node 
3  and  4  are  faulty  and  calculate  the  /,  by  (13) 

0  0  0 

0  0  0 

0.0625  -  0.008929  -  0.151789 

L  -  0. 1 875  -  0. 1 1607 1  -  0.026786  J 

It  is  seen  that  the  first  two  rows  of  /,  are  zero  and  hence 
the  first  two  rows  of  AT,  are  zero.  Since  AT,  is  symmetrical, 
the  first  two  columns  of  AT,  are  zero.  Now,  Sr[T,  +  AT,] 
can  be  calculated  from  (20)  and  deleting  the  first  two  rows, 
we  obtain 


ST[K  +  AT,] 

Finally,  we  can  calculate  the  AT,  by  (26): 


It 


0625 

125 


0.1339 

0.125 


0.2768] 
0.125  j' 


0 

0 


26J 


If  the  testability  condition  is  not  satisfied,  then  addi¬ 
tional  test  points  are  needed.  The  choice  of  this  additional 
test  points  can  be  greatly  simplified  by  the  usage  of  Theo¬ 
rem  4.  The  most  important  feature  of  this  theorem  is  that  it 
depends  only  on  the  graph  but  not  the  element  values  of 
the  network.  It  is  shown  that  this  graphical  condition  is 
necessary  and  almost  sufficient  for  the  testability  condi¬ 
tion. 

VI.  Appendix 

A.  The  Uniqueness  of  Solution  of  Equation  Bx  =  p 
Recall  that  (8)  has  the  form 

Bx  —  p  (Al) 

where  B  e  CmX "  with  n>  m,  x6C"  and  p  €  Cm.  For 
practical  problems,  n  >  m,  and,  therefore,  the  solution  of 
(Al)  in  general  is  not  unique.  However,  if  we  restrict  the 
number  of  nonzero  components  of  x  to  be  less  than  m, 
then  the  solution  could  be  made  unique.  Hence,  we  want  to 
find  the  solution  of  (Al)  in  Xk. 

Note  that  a  solution  jc  e  Xk  of  (8)  exists  if  and  only  if  p 
is  a  linear  combination  of  some  k  columns  of  B.  Hence¬ 
forth,  let  the  subspaces  spanned  by  the  combinations  of  k 
column  vectors  of  B  be  denoted  by  /?,,  i » 1,2,-  •  -,r,  where 
r**C(n.k).  Let  the  intersections  of  combinations  of  two 


+  1  -1 

.-1  +2. 

R,’s  be  denoted  by  Qr  j  =  1,2,-  •  • 
Furthermore,  let 

,q,  where  q=*C(r, 2). 

_  The  diagnosis  is  two  branch  faults: 

r 

■  Ay,  - 1 

Ay3-1. 

8-  U  R, 

i-i 

(A2) 

<7 

a  -  u  Qr 

(A3) 

VII.  Conclusion 

'  ,-l  ' 

The  purpose  of  node  diagnosis  is  to  determine  those 
P  nodes  which  are  faulty.  A  node  is  faulty  if  any  branch 
incident  with  it  is  faulty.  Therefore,  if  faulty  nodes  are 
correctly  located,  all  the  faulty  branches  must  be  incident 
with  these  faulty  nodes.  A  network  is  k-node-fault  testable 
if  from  some  measurements,  (1)  one  can  determine  whether 
or  not  there  are  k  or  less  faulty  nodes;  and  in  addition,  (2) 
_  if  affirmative,  the  faulty  nodes  can  be  uniquely  located. 
Note  that  in  the  definition  of  testability  it  includes  the 
determination  of  the  number  k. 

A  necessary  and  almost  sufficient  condition  of  k-node- 
fault  testability  is  given  in  Theorem  2.  It  has  three  features: 

1)  The  minimum  value  of  test  points  m  for  which  the 
k-node-fault  testability  condition  (9)  to  be  satisfied  is  m  — 
k  + 1.  Therefore,  it  is  possible  to  diagnose  k-node  fault 
with  k  +  1  test  points,  regardless  of  number  of  branches 
and  number  of  nodes. 

2)  The  matrices  to  be  tested  in  (9)  have  size  only  m  x (fc 
+  I),  regardless  of  how  large  the  circuit  is. 

^  3)  The  testability  condition  depends  on  the  nominal 

circuit  and  can  be  tested  before  a  fault  occurs. 

S*  If  the  testability  condition  is  satisfied,  procedures  to 
locate  the  faulty  nodes  and  faulty  branches  are  given  in 
Theorem  1  and  Theorem  5,  respectively. 


Evidently,  8r  c  8. 

Lemma  1:  Equation  (Al)  has  a  solution  x  e  Xk  if  and 
only  if  p  e  Q,  and  the  solution  is  unique  in  Xk  if  and  only  if 
p  e  8  -  8,. 

Proof:  A  solution  x  €  Xk  of  (Al)  exists  if  p  e  R,  for 
some  i,  i.e.,  pe8.  Furthermore,  (Al)  has  two  distinct 
solutions  in  Xk  if  p  €  Rt  and  p  e  Rj  with  i  *  j,  i.e.,  p  e  8,. 
Therefore,  (Al)  has  a  unique  solution  in  Xk  if  p  e  8  -  Sr. 


Let  6,,  b2,  -  •  -  ,bn  be  the  column  vectors  of  B. 
Lemma  2: 

Q.E.D. 

dim  R,  -  k. 

i  -1,2, 

(A4) 

and 

dim£y  <  k. 

j  « 1,2,*  ■  •  ,q 

(A5) 

if  and  only  if 

rank[b(i,b(j,--  -k  +  1, 

VI  <i,  <i2  <  •  •  •  <ik*\  <  «  (A6) 


i.e.,  every  combination  of  (k  +  1)  columns  of  B  is  linearly 
independent. 
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This  lemma  has  an  important  implication  as  follows.  It 
can  be  shown  that 

r  r 

2-8,-  U  Ri  -  U  (/?, n  /?,)  . 

i  - 1  j  - 1 

In  view  of  (A5),  the  measure  of  the  second  term  in  the 
bracket  is  zero  with  respect  to  any  ^-dimensional  measure. 
Consequently,  the  measure  of  2,  is  zero.  In  other  words, 
for  any  bounded  distribution  of  p,  the  conditional  proba¬ 
bility  P(p  e  Q  -  8,|/>  e  Q)  —  1.  The  above  discussion  can 
be  summarized  together  with  Lemma  1  in  the  following. 

Theorem  Al:  Let  p  e  2.  Then  (Al)  has  a  unique  solu¬ 
tion  in  Xk  for  almost  all  p  s  2  if  and  only  if  condition  (A6) 
is  satisfied 

Since  the  existence  of  the  solution  is  guaranteed  for  fault 
diagnosis  problems,  condition  (A6)  becomes  the  only  con¬ 
dition  needed  for  the  uniqueness  of  solutions  in  Xk. 

Proof  of  Lemma  2:  Clearly  (A6)  implies  (A4).  It  also 
implies  that  Rt  *  Rj  if  i  —  j.  Therefore, 

dim^ni?;)  <  dim  R,  =  k 

which  is  (A5). 

Conversely,  let  (A4)  be  satisfied  but  (A6)  be  not  satis¬ 
fied;  w.l.g.,  let 


Let  rank  ?,,  =  k.  Without  loss  of  generality,  let  the  first 
k  rows  of  Pn  be  linearly  independent.  If  not,  apply  simul¬ 
taneously  row-operations  on  [/*, ,  Pn]  and  column  opera¬ 
tions  on 


Ql2 

Qn 


then  the  first  kxk  principal  submatrix  of  P  is  nonsingular. 
Then  the  last  (n  -  k)x(n  -  k)  principal  submatrix  of  Q  is 
nonsingular.  Consequently,  rank  Qn  ~  l —  n-m. 

From  Lemma  Bl,  the  following  is  self-evident. 

Lemma  B2:  Let  Z„,  Yn  s  C"x"  and  YnZn  —  I.  Let 


YH-[Ynm  yj. 


where  Zm„  e  CmX",  Ynm  e  C"Xm  and  m  +  / «  n.  Let  r  «  m. 
Then  every  mXr  submatrix  of  Zmn  has  rank  r  if  and  only 
if  every  (n  -  r)x  /  submatrix  of  Ynl  has  rank  /. 
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*  +  l 

*1  “  L  ail 

>1 

(A7) 

i-  2 

[l] 

where  a,  e  C  and  not  all  a/s  are  zero.  Let 

12) 

P~Sp[b,,b2,- ■ 

[3] 

Ri  *  Sp[f*i.  b2,-  • 

•Al 

W 

^2  *  Sp  [  b2 ,  &3 ,  •  • 

■  A+il 

[5] 

where  Sp[x,,- •  *,xR]  is  the  notation  for  the  subspace 
spanned  by  {x„x2,- • -,x,>.  It  follows  from  (A7)  that  [6] 
dim  P  <  k,  and  from  (A4)  that  dim  /?,  =  £=-  dim  R2.  Since 


PoR{  md  PdR2,  [7] 

R^R2~P. 

[81 

Therefore,  dim  /l,  n  R2  -  k,  which  contradicts  (A5). 

Q.E.D.  « 
[10] 

B.  The  Proof  of  Theorem  3 


The  proof  of  Theorem  3  follows  immediately  from 
Lemma  B2  and  Theorem  2.  Let  P,  Q  e  C"x";  let  P  and  Q 
be  partitioned  in  the  following  way; 


P 

Q 


Pu  Pi  2 

Pn  Pn 

Qu  Qn 
Qn  Qn 


(Bl) 


I"! 

[12] 

[131 

[14) 

[15] 


where  e  CmXK,  Qn  e  C*xm,  *  <  m  and  m  +  /  -  n.  IJfl 
Lemma  Bl:  La  PQ- 1.  Then,  rank  P{[  -  k  if  and  only  * 
if  rank Qu  “  n  -  m.  [18] 

Proof:  First,  consider  m-  k.  See  [23]. 

Next,  consider  the  case  m  >  k. 
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ABSTRACT 


Note  that  Yb,  Yn  need  not  be  synetrlcal,  end: 


The  topological  condition*  for  lc-nodef  k-cut 
fault,  and  k-rank  branch-set  fault  testability  by 
single  test  signal  vector  In  analog  drculta  are 
given.  These  conditions  which  are  both  necesaary 
and  sufficient  are  esaentlal  in  the  choice  of  teat 
points.  These  conditions  also  have  appllcationa 
to  the  dealgn  of  testable  circuits.  The  outcome 
of  these  methods  can  be  further  dlagnoaed  by  the 
uaage  of  multiple  test  signal  vectors.  This  la 
also  discussed  and  the  associated  problems  are 
studied. 


I.  INTRODUCTION 

Let  N  (Fig.  1)  be  a  connected  b-branches, 
(n+l)-nodes,  (m+l) -terminals,  linear,  time  invari¬ 
ant,  lumped  non-reciprocal  network  (Fig.  1).  Out 
of  the  (n+1) -nodes,  (m+l)  are  accessible  terminals 
for  exltatlon  and  measurement.  Label  the  nodes  In 
the  following  way.  Let  one  of  the  accessible 
nodes  be  the  reference  node.  Then  label  the  rest 
of  the  accessible  nodes  (1  *  l,...,m)  ahead  of  in¬ 
accessible  nodes  (1  -  m+l,...,n).  The  following 
notations  are  used  for  N. 


vB  -  terminal  (accessible  node)  voltage  vector. 
la  -  terminal  current  vector, 
vf,  -  branch  voltage  vector, 
lb  -  branch  current  vector. 
v„  -  nod*  (Including  accessible  node)  voltage 
vector. 

Yfc  -  branch-admittance  matrix. 

Y„  -  node-admittance  matrix. 

Mq  -  the  set  of  all  accessible  nodes. 

M  -  the  set  of  all  accessible  nodes  except  the 
reference  node. 


Fig.  1 


*This  research  Is  supported  In  part  by  ONR 
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lb  -  Tb  vb 

(1) 

Yn  -  A  Yb  AT 

(2) 

where  A  Is  the  Incidence  matrix  of  the  graph. 

Now  consider  the  network  N  Is  perturbed  to 
(N+AN)  In  the  way  that  Yfc  Is  perturbed  to  Yb+AYb, 
and  the  graph  remains  the  same.  In  the  perturbed 
network,  we  denote  the  corresponding  voltages, 
currents  and  admittance  by  va+Ava,  la+Ala,  v^-Mv^, 
lfc+Alb,  v„+Av„  and  Yn+AY„  respectively. 

Given  Yfe  and  the  graph,  the  purpose  of  the 
fault  diagnosis  Is  to  estimate  AYb  from  the  Infor¬ 
mation  lg,  dva  and  Ai,.  va  can  be  calculated  from 
1*  and  N.  Without  loaa  generality,  we  may  aaaume 
that 


Ala  -  0 


(3) 


l.e.,  apply  same  1,  to  the  original  network  N  and 
the  perturbed  network  N+AN. 

Denote  Z„  and  partition  it  In  the  following 

way: 


Zn  -  Tn-1 


Zmn 

Zln 


(4) 


where  Z^  e  C^*n,  Z*„  c  Ctxn,  and  1  •  n-m.  Under 
thla  condition.  It  can  be  shown  that 

Zan  A(AYb>(vb  ♦  Avb)  -  Ava  (5) 

From  Eq.  (5),  assuming  the  number  of  nonsero 
elements  In  AYb  is  small.  It  suggests  that  on*  can 
determine  the  locations  of  nonsero  elements  of  AYb 
and  therefore  the  location  of  faulty  branches  pro¬ 
vided  that  the  number  of  accessible  nodes  Is  large 
enough.  This  discovery  was  mad*  by  Blernackl  and 
Sandier  (1,21  and  Sakla  at  al.  (3].  The  advan¬ 
tages  of  this  approach  are  that  on*  can  rely  only 
on  the  parameters  of  N  (or  the  nominal  circuit) 
and  the  measurements  to  diagnose  the  faulty  cir¬ 
cuit  and  Che  computation  Involved  Is  all  linear. 

However ,  there  are  certain  inevitable  struc¬ 
tures,  such  as  a  small  loop.  In  the  practical  clr- 
culta  which.  If  not  carefully  taken  care  of,  would 
mesa  up  the  results  of  this  approach.  In  the  next 
section,  we  will  discuss  the  systematical  ways  of 
materialising  this  approach  and,  nor*  Importantly, 
provide  the  graphical  conditions  for  placing  the 
test  points. 
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For  the  purpose  of  simplicity ,  all  the 
reeulta  presented  here  are  for  the  caae  when  N 
contains  no  dependent  sources.  Extensions  of 
necessary  conditions  and  sufficient  conditions  to 
include  dependent  sources  are  given  In  (4], [SI, [6] 
and  [10!. 

II.  MODE- P AULT,  CUT-FAULT,  AND  BRANCH-SET 
FAULT  DIAGNOSIS 

Is  this  section,  we  will  present  three  dif¬ 
ferent  methods  of  systematically  Isolating  the 
faults  In  analog  circuits  from  a  single  test  aig- 
nsl  vector.  These  three  methods,  node-fault,  cut- 
fault,  and  branch-set  fault  diagnosis,  represent 
three  different  ways  of  grouping  the  branches, 
they  will  he  concisely  discussed  subsequently  and 
the  graphical  conditions  for  placing  the  teat 
points  in  each  case  will  be  summarized . 

First,  let  us  define  a  faulty  branch  and  a 
faulty  node. 

Definition  1:  Branch  k,  k  -  t,2,...,b  la  said  to 
be  fault-free  If  the  k-th  row  of  AYfe  Is  zero. 

Mode  J,  J  -  l,2,...,n.  Is  said  to  be  fault-free  If 
all  branches  Incident  with  node  J  are  fault-free. 
Otherwise  they  are  faulty . 

Then,  k-node-fault  testability  can  be  defined 
as  follows. 

Definition  2:  A  network  N  Is  said  to  be  k-node- 
fault  testable  if  when  N  la  perturbed  to  (IW-AN), 
one  will  he  able  to  determine,  by  choosing  one 
appropriate  test  signal  vector  iat  from  the  meas¬ 
urement  on  accessible  nodes  Mg: 

(A)  whether  or  not  N  has  no  more  chan  k  node¬ 
faults, 

(g)  If  affirmative,  the  faulty  nodes  can  be 
uniquely  located. 

The  following  theorem  provides  a  graphical 
condition  for  k-node-fault  testability.  To  state 
Che  theorem,  first  let  us  construct  a  modified 
graph  Gq  from  N  by  daletlng  all  branches  Incident 
with  the  reference  node  and  connecting  the  rest  of 
accessible  nodes  to  a  new  node  a,,.  Then  the 
theorem  follows. 

Theorem  1  [101:  The  following  three  statements 
are  equivalent: 

(1)  N  is  k-node-fault  testable  w.r.t.  M  for 
almost  all  values  of  T&. 

(2)  There  ere  at  least  (k+1)  disjoint  paths1 
between  at y  inaccessible  node  and  On  In  G|,. 

(3)  There  are  at  least  local  (k-H) -connectivity^ 
la  Gd  between  any  Inaccessible  node  and  an. 

When  N  is  k-node-fault  testable,  the  proce¬ 
dure  of  locating  the  faulty  nodes  is  given  in  [41 
and  [101. 

1.  Two  paths  are  said  to  be  disjoint  If  there  are 
no  cnmmnn  nodes  except  the  two  end  nodes  (81. 

2.  The  local  connectivity  of  two  nodes  la  the 
minimum  number  of  the  nodes  separating  them 
(91. 


A  node  fault  la  a  way  of  grouping  branchea 
incident  with  the  node.  An  extenaion  is  the  group 
of  branchea  by  cuts.  A  cut  is  a  cut-set  or  a  dis¬ 
joint  union  of  cut-sets  in  a  graph  G  (7).  For  a 
given  set  of  t  cuts,  there  la  a  corresponding  cut 
matrix  H  c  Rtxl>  whoae  (l,J)th  entry  hjj  »  1  if 
branch  J  is  in  cut  1  and  hi«  -  0  If  branch  J  is 
not  In  cut  1.  A  set  of  basis  cuts  0  In  G  la  the 
set  of  n  cuts  whose  corresponding  cut  matrix  haa 
rank  n  where  n  Is  the  number  of  nodes  minus  one. 
Then  the  following  definitions  can  be  stated. 

Definition  3;  A  cut  Is  said  to  be  fault-free  If 
all  branches  contained  In  this  cut  are  fault-free. 
Otherwise,  it  la  faulty. 

Definition  4:  For  a  given  set  of  basis  cuts  Q,  a 
network  N  is  said  to  he  k-cut-fault  testable  if 
when  N  la  perturbed  to  N+AN,  one  will  be  able  to 
determine,  by  choosing  one  appropriate  test  signal 
vector  1„,  from  the  measurements  on  M„: 

(A)  whether  or  not  N  has  no  more  than  k  cut- 

faults  in  Q. 

(B)  if  affirmative,  the  faulty  cuts  can  be 

located  uniquely . 

With  a  mild  condition,  it  can  be  shown  [S] 
that.  In  a  circuit  N,  k-cut-fault  testability  for 
any  given  set  of  basis  cuts  is  equivalent  to  k- 
node-fault  testability,  and  that  the  faulty  cut 
can  be  uniquely  determined. 

For  a  given  set  F  of  faulty  branchea,  the 
number  of  faulty  cuts  will  be  different  for  dif¬ 
ferent  choices  of  set  of  basis  cuts.  The  follow¬ 
ing  theorem  shows  how  to  choose  the  set  of  basis 
cuts  so  that  it  contains  fewest  faulty  cuts. 

First,  let  F  be  the  set  of  faulty  branches  In  N 
and  k'  be  the  number  of  nodes,  excluding  the  ref¬ 
erence  node,  incident  with  F.  Since  F  may  not  be 
connected,  let  F  consist  of  r  separated  parts. 

Theorem  2  [31 ;  For  a  given  F  In  a  circuit  N  there 
exists  a  set  of  basis  cuts  Qp  which  has  only  (k'~ 
r)  faulty  cuts.  Furthermore,  aiqr  other  sec  Q  has 
at  least  (k'-r)  faulty  cuts.  In  other  words,  (k'~ 
r)  is  the  lowest  bound  of  faulty  cuts  in  any  Q. 

Remark:  Theorem  2  haa  the  following  Implications. 
it  N  Is  k-node-fault  testable,  then  for  any  fault 
pattern  F  for  which  (k"-r)  <  k,  there  exists  Op 
which  can  uniquely  locate  the  n-(k'-r)  fault-free 
cuts.  F  will  be  contained  In  the  remainder  of  the 
graph.  Since  there  are  no  more  than  C(b,k)  number 
of  fault  patterns  for  which  lk'-r)  <  k,  we  need 
only  to  use  no  mors  than  C(b,k)  number  of  Q's  to 
cover  all  such  fault  patterns.  The  construction 
of  0y  Is  given  In  [S]. 

Finally ,  let  us  consider  another  method  of 
fault  Isolation,  namely  k-rank  branch-set  diagno¬ 
sis.  A  k-rank  branch-set  Is  a  subgraph  of  N  which 
consists  of  (1)  k  branches  containing  no  loops  and 
(2)  all  of  those  branches  each  of  which  forms  a 
loop  with  branches  in  ( 1 ) .  Since  the  incident 
aatrlx  of  a  k-rank  branch-set  has  rank  k,  the  name 
follows.  Then  the  following  definition  cen  be 
given. 

Definition  5:  A  k-rank  branch-set  is  said  to  be 
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fault-free  If  all  branehea  contained  In  thla 
branch-act  arc  fault-fraa.  Otherwise,  It  la 
faulty . 

Evidently,  given  a  set  of  faulty  branches  In 
NTAN,  there  nay  be  nany  k-rank  branch-sets  which 
all  contain  these  faulty  branches.  However,  there 
exists  a  unique  k°-rank  branch-set  In  N+AN  such 
that  If  another  k-rank  branch-set  also  contains 
all  the  faulty  branches  then  k  >  k°.  Define  this 
k°-rank  branch-set  as  minimum-rank  faulty  branch- 
set.  The  goal  of  k-rank  branch-set  diagnosis  Is 
to  locate  this  uinlaua-rank  faulty  branch-set.  It 
can  be  accomplished ,  If  the  condition  given  in 
Theorem  3  Is  satisfied,  by  a  procedure  similar  to 
that  In  ( 1,2,31 .  However,  because  of  our  grouping 
of  branch-sets,  the  unnecessary  repit it ion  of 
computlons  resulting  from  Indistinguishable  fault 
patterns  la  eliminated.  The  detail  can  be  found 
in  (6). 

Next,  we  will  give  the  definition  of  k-rank 
branch-set  fault  testability . 

Definition  6:  A  network  N  Is  said  to  be  k-rank 
branch-set  fault  testable  If  when  N  la  perturbed 
to  Nt4N,  one  will  be  able  to  determine,  by  choos¬ 
ing  one  appropriate  test  signal  vector  1*.  from 
the  measurements  on  MgS 

(A)  whether  or  not  there  exists  a  k-rank  branch- 
set  which  contains  all  faulty  branches  in  N, 

(B)  If  affirmative,  the  minimum-rank  faulty 
branch-set  can  be  uniquely  determined.  The 
minimum-rank  Is  no  greater  than  k. 

As  In  the  caee  of  k-node-fault  testability , 
there  Is  a  graphical  condition  for  k-rank  branch- 
set  fault  testability.  To  state  the  theorem,  we 
need  to  introduce  another  modified  graph  Cfe  of  N. 
Thla  graph  G(,  is  constructed  from  N  by  connecting 
all  the  nodes  in  Mg,  l.e.,  all  the  accessible 
nodes  Including  the  reference  node,  to  a  new  node 
Notice  that  the  difference  In  dealing  with 
the  reference  node  here  and  the  case  of  k- 
n ode-fault  testability. 

Theorem  3  [61;  The  following  three  stateawnts  are 
equivalent : 

(1)  N  la  k-rank  branch-set  fault  testable 
w.r.t.  %  for  almost  all  values  of  Tj>. 

(2)  There  are  at  least  (k+2)  disjoint  paths 
between  any  Inaccessible  node  and  aj,  In 
cb* 

(3)  There  are  at  least  local  (k+2)- 
connectlvlty  In  C|,  between  any  Inacces¬ 
sible  node  and 

As  a  conclusion  of  this  section,  let  us  in¬ 
vestigate  the  relationship  between  k-node-fault 
testability  and  k-rank  branch-set  fault  testabil¬ 
ity.  A  relationship  for  a  given  Hg  in  N  can  be 
summed  up  In  the  following  two  theorems. 

Hyron  A  16]  t  for  eny  given  set  Hg  of  accessible 
nodes.  If  M  is  k-rank  branch-sat  fault  testable, 
then  N  Is  k-node-fault  testable  w.r.t.  any  refer¬ 
ence  node  In  Hg. 

The  converse  of  Theorem  A  Is  not  true  In 
general.  However,  If  N  satisfies  a  structural 


condition,  then  the  converse  is  also  true.  This 
Is  stated  in  the  following  theorem. 

Theorem  5  {61 :  Suppose  that  there  is  an  aceessl- 
ble  node  mg  in  Hg,  which  is  adjacent  to  every  in¬ 
accessible  node.  Then  k-rank  branch-set  fault 
testability  is  equivalent  to  the  k-node-fault 
testability  if  mg  Is  used  as  the  reference  node. 

Ill.  MULTIPLE  TEST  SIGNAL  VECTORS 

From  the  discussions  In  the  previous  section. 
It  Is  clear  that  the  fault  Isolation  methods  with 
single  test  signal  vector  can  only  isolate  the 
faults  into  a  subcircuit  but  are  unable  to  further 
identify  the  faults.  In  this  section,  we  will 
discuss  how  to  take  advantage  of  the  multiple  test 
signal  vectors  In  fault  diagnosis.  In  the  follow¬ 
ing  discussion,  we  only  state  the  theorems  con¬ 
cerned  with  k-rank  branch-set  diagnosis.  The  re¬ 
sults  concerning  the  other  two  cases  are  similar 
sad  therefore  are  omitted. 

To  begin  with,  let  us  Introduce  some  more 
notations.  A  set  of  branches  Is  acyclic  if  It 
contains  no  loops.  .  An  acyclic  t-branch  in  N  is 
excitable  w.r.t.  **0  If  the  branch  voltage  vectors 
of  these  t  branches  can  be  generated  to  span  a  t- 
dlmenslonal  space  by  appropriately  applying  mul¬ 
tiple  test  signal  vectors  on  Mg.  If  every  acyclic 
t-branch  in  N  Is  excitable  w.r.t.  Mg,  then  N  is 
said  to  be  t-acycllc-branch  excitable.  With  these 
definitions,  we  can  state  the  following  theorems 
[61. 

Theorem  6:  Suppose  that  N  is  k-rank  branch-set 
fault  testable  and  H+AN  is  k-acydlc-branch 
excitable.  Then  all  the  faulty  branches  can  be 
uniquely  determined  by  k  Independent  multiple  test 
signals. 

Theorem  7:  N  Is  (k-l)-rank  branch-set  fault 
testable  If  and  only  If  N  Is  k-acydlc-branch 
excitable. 

Theorem  8:  If  N  Is  k-acydlc-branch  excitable  and 
NT  AN  ha*  the  same  graph  as  N  then  NTAN  Is  k- 
acydlc-branch  excitable  for  almost  all  AYfe. 

In  summary ,  suppose  that  H  la  k-rank  branch- 
set  fault  testable.  Then  we  can  determine  whether 
a  sat  of  fault  branches  can  be  contained  In  one  of 
the  k-rank  branch-sets  by  a  single  test  signal 
vector  (Theorem  3).  Furthermore,  If  affirmative, 
we  can  uniquely  determine  the  minimum-rank  branch- 
set  which  contains  the  faulty  branches  by  a  single 
test  signal  vector  (Theorem  3)  and  we  can  uniquely 
determine  the  faulty  branches  by  a  k  multiple  test 
signal  vectors  If  the  conditions  of  Theorem  8  are 
fulfilled. 

Similar  results  can  be  shown  for  node-fault 
and  cut-fault  diagnosis. 
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inefe  are  tnree  Branch  faults  (a,b,c)  but  only  two  node  almost  all  Ao„.  Furthermore,  all  faulty  nodes  can  be 
faults  (1,2).  uniquely  isolated  by  ( 1 3). 
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